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TITLE OF THE INVENTION 
Compositions, Kits, and Methods for Effecting Adenine Nucleotide 
5 Modvilation of DNA Mismatch Recognition Protehis 
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[0001] This application is a divisional of U.S. Non-Provisional Patent Application Serial 
No. 09/143,571, filed August 28, 1998, and is entitled to priority pursuant to 35 U.S.C. §1 19(e) 
10 to U.S. Provisional Application 60/093,935, filed July 23, 1998, to U.S. Provisional AppUcation 
No.60/066,977, filed November 28, 1997, and to U.S. Provisional Application No. 60/057,136, 
filed August 28, 1997. 

STATEMENT REGARDING FEDERALLY SUPPORTED 
15 RESEARCH AND DEVELOPMENT 

[0002] This research was supported in part by U.S. Government funds (NIH grants numbers 
CA56542 and CA67007 and NRSA grant CA73 134), and the U.S. Government may therefore 
have certain rights in the invention. 

20 REFERENCE TO A MICROFICHE APPENDIX 

[0003] Not applicable. 

BACKGROUND OF THE INVENTION 
[0004] The field of the invention is DNA mismatch protein binding, including animals 
25 useful as models for tumorigenesis, apoptosis, and aging. 

DNA Mismatch Repair 

[0005] The most widely accepted model for DNA post-replication mismatch repair is based 
largely on the model of the DNA adenine methylation (Dam)-Instructed pathway of Escherichia 
30 coU proposed by Modrich (1986, Basic Life Sci. 38:303-310; Modrich, 1987, Ann. Rev. 
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Biochem. 56:435-466; Modrich, 1989, J. Biol. Chem. 264:6597-6600; Modrich, 1991, Anna. 
Rev. Genet 25:229-253; Modrich et al, 1996, Amu. Rev. Biochem. 65:101-133). According 
to this model, the MutS protein recognizes and binds mismatched nucleotides resulting from 
polymerase misincorporation errors to form a MutS-DNA product (Su et al, 1986, Proc. Natl. 
5 Acad. Sci., USA 83:5057-5061; Su et al., 1988, J. Biol. Chem. 263:6829-6835). MutS 

mismatch binding is followed by the interaction of MutL protein with the MutS-DNA product 
(Grilley et al., 1990, Mutat. Res. 236:253-267), which accelerates ATP-dependent translocation 
of the MutS-MutL complex (Allen et al., 1997, EMBO J. 16:4467-4476) to a hemimethylated 
GATC Dam site to which MutH protein is bound (Welsh et al, 1987, J. Biol. Cham. 
10 262:15624-15629; Au et al., 1992, J. Biol. Chem. 267:12142-12148). The MutS-MutL 
■ 3 complex stimulates an intrinsic endonuclease activity of MutH protein, which cleaves the non- 

■f|3 methylated (i.e. more recently replicated) DNA strand (Welsh et al., 1987, J. Biol. Chem. 

r 262:15624-15629; Lahue et al., 1987, Proc. Natl. Acad. Sci. USA 84:1482-1486; Su et al., 

J8 1989, Genome 31:104-111; Cooper et al., 1993, J. Biol. Chem. 268:11823-11829; Grilley et al, 

15 1993, J. Biol. Chem. 268:11830-11837). Strand cleavage enables one of three single-stranded 

'4'- 

i|3 exonucleases of E. coli (RecJ, Exol, ExoVII) to degrade the non-methylated strand, which can 

^ then be re-synthesized by the E. coli PolIII holoenzyme complex (Lahue et al, 1989, Science 

245 : 1 60- 1 64). The net result is a strand-specific mismatch repair event. 
[0006] Many genetic studies performed using E. coli support this interpretation. For 
20 example bacteria having a mutated mutH, mutL^ or mutS gene exhibit a mutator phenotype that 
is presumed to be the result of the increased probability of misincorporation errors leading to 
mutations (Demerec et al., 1957, Bact. Genet. , Carnegie Inst. Wash. Yearbook 370:390-406; 
Miyake, 1960, Genetics 45:755-762; Siegel et al., 1967, J. Bacteriol. 94:38-47; Hill, 1970, 
Mutat, Res. 9:341-344). However, not all predictions arising from the E. coli Dam-instructed 
25 model agree with experimental results. For example, bacteria having a mutation in each of the 
recJ, exol, and exoVII genes do not exhibit a mutator phenotype, suggesting that other 
exonuclease(s) or mechanism(s) are involved in the mismatch repair process. 
[0007] Homologs of the procaryotic MutS and MutL proteins have been identified in 
eukaryotes (Fishel et al., 1993, Cell 75:1027-1038; ProUa et al, 1994, Science 265:1091-1093; 
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Bronner et al, 1994, Nature 368:258-261). MutH analogs appear to exist only in gram-negative 
bacteria. 

[0008] Multiple MutS and MutL homologs have been identified in yeast and human cells 
which individually participate in such diverse activities as nuclear and organelle mismatch 
5 repair as v^ell as distinct meiotic functions (Fishel et al, 1997, Curr. Opin. Genet. Dev. 7:105- 
113). Germ-line mutations of the human MutS and MutL Homologs, hMSH2, hMLHl, and 
hPMS2, have been found to be associated v^ith the common cancer predisposition syndrome, 
hereditary non-polyposis colorectal cancer (HNPCC; Bronner et al., 1994, Nature 368:258-261; 
Fishel et al., 1993, Cell 75:1027-1038). Yeast and human MutS and MutL homologs exist 

10 primarily as heterodimeric proteins. Yeast MSH2 protein has been found to be associated with 
MSH3 or MSH6, and yeast MLHl has been found to be associated with PMSl . Human 
hMSH2 protein has been found to be associated with hMSH3 or hMSH6 (also designated 
GTBP or pi 60 by some authors), and human hMLHl has been found to be associated with 
hPMS2 (Li et al, 1995, Proc. Natl Acad., Sci. USA 92: 1950-1954; ProUa et al, 1994, Science 

15 265:1091-1093; Drummond et ah, 1995, Science 268:1909-1912; Marsischky et al., 1996, Gen. 
Dev. 10:407-420; Acharya et al, 1996, Proc. Natl. Acad. Sci. USA 93:13629-13634). 
Furthermore, MSH2/MSH3 and MSH2/MSH6 protein complexes appear to possess 
overlapping and redundant mismatch binding activities (Acharya et al., 1996, Proc. Natl. Acad. 
Sci. USA 93:13629-13634; Risinger et al, 1996, Nature Genet 14:102-105). 

20 [0009] Classification of MutS and MutL homologs is based on the presence in the proteins 
of highly conserved regions of amino acid identity. The most highly conserved region among 
MutS homologs includes approximately 150 amino acids which comprise a helix-tum-helix 
domain associated with a Walker A adenine-nucleotide and magnesium binding motif (Walker 
et aL, 1982, EMBO J. 1 :945-951). This adenine nucleotide binding domain constitutes more 

25 than 80% of the identifiable homology between MutS homologs (Fishel et al., 1997, Curr. 

Opin. Genet. Dev. 7:105-1 13). Both purified bacterial MutS homologs and purified yeast MutS 
homologs possess an intrinsic low-level ATPase activity (Haber et al., 1991, EMBO. J. 
10:2707-2715; Chi et al., 1994, J. Biol. Chem. 269: 29993-29997; Chi et al, 1994, J. Biol. 
Chem. 269:29984-29992; Alani et al, 1997, MoL Cell Biol. 1 7: 2436-2447). This ATPase 

30 activity is likely to be important for the function of MutS homologs, as indicated by the fact 
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that mutation of conserved amino acid residues in the adenine nucleotide binding domain 
results in a dominant mutator phenotype in both bacteria and yeast (Haber et al., 1991, EMBO. 
J. 10:2707-2715; Wu et al., 1994, J. Bacteriol 176:5393-5400; Alani et al, 1997, MoL Cell 
Biol. 1 7: 2436-2447). A central role for the adenine nucleotide binding domain is consistent 
5 with the ATP-dependent translocation model of mismatch repair proposed by Modrich and 
colleagues (Allen et al., 1997, EMBO J. 16:4467-4476). 

[0010] Genetic and biochemical studies of the human mismatch repair process indicate that 
it is similar to bacterial mismatch repair, except that the physiologically relevant mechanism for 
directing strand specificity is unknown (Miller et al., 1976, Proc. Natl. Acad. Sci. USA 
10 73:3073-3077; Glazer et al, 1987, Mol. Cell Biol, 7:218-224; Holmes et al, 1990, Proc. Natl. 
O Acad. Sci. USA 87:5837-5841; Thomas et al., 1991, J. Biol. Chem. 266:3744-3751; Fang et al, 

J 1993, J. Biol. Chem. 268:11838-11844; Longley et al., 1997, J. BioL Chem. 272: 10917- 

j?J 10921). Purified hMSH2 protein binds mismatched nucleotides and DNA lesions (Fishel et al., 

J 1994, Science 266:1403-1405; Fishel et al., 1994, Cancer Res. 54:5539-5542; Mello et al, 

15 1996, Chemistry & Biology 3:579-589), and the specificity and affinity of that recognition is 
ip enhanced by association of hMSH2 with hMSH3 or hMSH6 (Drummond et al., 1995; Acharya 

et al, 1996, Proc. Natl. Acad. Sci. USA 93:13629-13634; Palombo et al., 1996, Curr. Biol. 
W 6:1181-1184). 

[001 1] Although the ability of MutS homologs to bind to mismatched duplex DNA has 
20 been recognized (e.g. U.S. Patent No. 5,556,750), methods of using MutS homologs in vitro 
have been limited by a lack of understanding regarding the properties of such homologs. A 
need remains for methods of binding MutS homologs and mismatched duplex DNA, which 
methods take advantages of the biochemical properties of such homologs. 



25 Transgenic and NuUizygous Animals 

[0012] The development of transgenic animals and nuUizygous animal models has provided 
important new avenues for the study of specific gene functions in differentiation, 
embryogenesis and neoplastic development (Palmiter et al, 1986, Ann. Rev. Genet. 20:465- 
499). Transgenic animals frequently serve as model systems for the study of various disease 

30 states and also provide an experimental system in which to test compounds for their ability to 
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regulate disease. Nullizygous animals are similarly useful as experimental systems for the 
testing of compounds useful for diagnosis, treatment, or both, of disease. 
[0013] Lukkarinen et al. (1997, Stroke 28:639-645) teaches that gene constructs which 
enable the generation of transgenic mice also enable the generation of other transgenic rodents, 
5 including rats. Similarly, nuUizygous mutations in a genetic locus of an animal of one species 
can be replicated in an animal of another species having a genetic locus highly homologous to 
the first species. For example, many genetic loci are highly homologous among mammals, and 
even more highly homologous among subgroups of mammals, such as among rodents. 
[0014] The mutator hypothesis of tumorigenesis suggests that loss in an organism of a 
10 chromosomal stability function, a chromosomal maintenance function, or both, results in an 

0 elevated mutation rate in the organism. An elevated mutation rate hastens accumulation of the 
^1 numerous mutations required for multistep carcinogenesis (Loeb, 1991, Cancer Res, 51:3075- 

1 3079). 

'J [0015] Loss of the function of p53 protein has been proposed to increase cellular 

sQ 15 hypermutability in an organism, thereby accelerating tumorigenesis, although a clear role for 

p53 protein in genomic instability remains controversial (Kastan et al., 1992, Cell 71 :587-597; 
^ Fishel et al, 1997, Curr. Opin. Genet. Dev. 7:105-1 13). p53, the gene encoding p53 protein, is 

frequently mutated in a wide range of human cancers including, but not limited to, colonic 

M 

tumors (Fearon et al., 1990, Cell 61 :759-767). Transgenic mice nullizygous for p53 are viable 
20 and susceptible to tumorigenesis (de Wind et al., 1995, Cell 82:321-330; Reitmair et al, 1995, 
Nature Genet, 1 1 :64"70; Donehower et al., 1992, Nature 356:215-221; Jacks et al,, 1994, Curr. 
Biol. 4:1-7; Furdie et al., 1994, Oncogene 9:603-609). 

[0016] Although nullizygous p53 mice can be used as models of carcinogenesis, the rates at 
which such mice develop tumors can be slower than what is desirable, particularly for large- 
25 scale screening studies involving numerous potential anti-cancer therapeutic or prophylactic 
compositions. What is needed is a transgenic mouse which, when exposed to a carcinogen, 
succumbs to tumorigenesis caused by the carcinogen more readily than does a nullizygous p53 
mouse and which, even when not exposed to an identifiable carcinogen, succumbs to tumors 
more readily than does a nullizygous p53 mouse . 
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[0017] Critical unmet needs also exist for animal models of programmed cell death 
(apoptosis) and of aging, 

[0018] The present invention satisfies the needs identified above. 



5 BRIEF SUMMARY OF THE INVENTION 

[0019] The invention relates to a method of modifying a mismatched duplex DNA. The 
method comprises contacting an MSH dimer and the mismatched duplex DNA in the presence 
of a binding solution. In one embodiment, the binding solution comprising a nucleotide 
selected from the group consisting of ADP and ATP, and the concentration of ATP in the 
10 binding solution is less than about 3 micromolar The MSH dimer thereby associates with the 
13 mismatched region of the mismatched duplex DNA, and the mismatched duplex DNA is 

=,p modified. In one embodiment, the MSH dimer is selected from the group consisting of a 

' prokaryotic MSH homodimer, a prokaryotic MSH heterodimer, a eukaryotic MSH homodimer, 

and a eukaryotic MSH heterodimer. The MSH dimer may, for example, be a homodimer of a 
h§ 15 MutS homolog selected from the group consisting of a human MutS homolog, a murine MutS 

homology a rat MutS homolog, a Drosophila MutS homolog, a yeast MutS homolog, and a 
J Saccharomyces cerevisiae MutS homolog. An example of a eukaryotic MSH homodimer is an 
lU MSH2 homodimer. The eukaryotic MSH heterodimer usefiil in this method comprises MutS 

homologs independently selected from the group consisting of an MSH2 protein, an MSH3 
20 protein, an MSH4 protein, an MSH5 protein, and an MSH6 protein. By way of example, the 
MSH dimer may be selected from the group consisting of an MSH2:MSH3 heterodimer, an 
MSH2:MSH6 heterodimer, and an MSH4:MSH5 heterodimer. In another embodiment of this 
method, the prokaryotic MSH dimer is a homodimer of Escherichia coli MutS. Preferably, the 
MSH dimer is substantially purified. 
25 [0020] According to this method, the concentration of ATP in the binding solution is 
preferably less than about 0.3 micromolar, or, more preferably, the binding solution is 
substantially free of ATP. In another embodiment of this method, at least one of the MSH 
dimer and the mismatched duplex DNA is bound to a support. In yet another embodiment, the 
mismatched duplex DNA has at least one free end. In still another embodiment, the 
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mismatched duplex DNA comprises a DNA strand generated by reverse transcription of mRNA 
obtained from an organism. 

[0021] According to one aspect of this method, the mismatched duplex DNA comprises a 
first DNA strand having a reference nucleotide sequence and a second DNA strand. The second 

5 strand may, for example, be selected from the group consisting of a DNA strand obtained from 
an organism, a DNA strand obtained by amplification of at least a portion of a polynucleotide 
obtained from an organism, a DNA strand obtained by cleavage of a polynucleotide obtained 
from an organism, and a DNA strand obtained by reverse transcription of a polynucleotide 
obtained from an organism. The second DNA strand may also comprise at least a portion of a 

10 gene associated with a cancer in the organism. In one embodiment, the organism is a human 
and the gene is selected from the group consisting of an oncogene and a tumor suppressor gene. 
By way of example, such genes include abl, akt2, ape, bcl2alpha, bcl2beta, bcl3, bcr, brcal, 
brca2, cbl, ccndl, cdk4, crk-II, csflr/fms, dbl, dec, dpc4/smad4, e-cad, e2fl/rbap, egfr/erbb-1, 
elkl, elk3, eph, erg, etsl, ets2, fer, fgr/src2, flil/ergb2, fos, fps/fes, fral, fra2, fyn, hck, hek, 

15 her2/erbb-2/neu, her3/erbb-3, her4/erbb-4, hrasl , hst2, hstfl, ink4a, ink4b, int2/fgf3, jun, junb, 
jund, kip2, kit, kras2a, kras2b, Ick, lyn, mas, max, mcc, met, mlhl, mos, msh2, msh3, msh6, 
myb, myba, mybb, myc, mycU, mycn, nfl, nf2, nras, p53, pdgfb, piml, pmsl, pms2, ptc, pten, 
rafl, rbl, rel, ret, rosl, ski, srcl, tall, tgfbr2, thral, thrb, tiaml, trk, vav, vhl, wafl, wntl, wnt2, 
wtl, and yesl . Preferably, the cancer is hereditary non-polyposis colon cancer and the gene is 

20 selected from the group consisting of mlhl, msh2, msh3, msh6, pmsl, and pms2. Alternately, 
the cancer may be selected from the group consisting of a leukemia, a lymphoma, a 
meningioma, a mixed tumor of a salivary gland, an adenoma, a carcinoma, an adenocarcinoma, 
a sarcoma, a dysgerminoma, a retinoblastoma, a Wilms' tumor, a neuroblastoma, a melanoma, 
and a mesothelioma. 

25 [0022] In another aspect of this method, the mismatched duplex DNA and the MSH dimer 
are contacted in the presence of at least one non-mismatched duplex DNA. According to this 
aspect, the method may further comprise separating the MSH dimer from the non-mismatched 
duplex DNA after contacting the mismatched duplex DNA and the MSH dimer. In one 
embodiment, the method fiirther comprising dissociating the mismatched duplex DNA and the 

30 MSH dimer after separating the MSH dimer from the non-mismatched duplex DNA and 
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thereafter amplifying the mismatched duplex DNA. The MSH dimer may be bound to a 
support prior to separating the non-mismatched duplex DNA from the MSH dimer and the non- 
mismatched duplex DNA is separated from the MSH dimer in the presence of a separating 
solution which is substantially free of ATP. In one embodiment, this method further comprises 
5 releasing the mismatched duplex DNA from the MSH dimer after separating the non- 
mismatched duplex DNA from the MSH dimer. If the mismatched duplex DNA has at least 
one free end, it may be released from the MSH dimer by contacting the MSH dimer with a 

releasing solution. The releasing solution may, for example, be selected from the group 

9+ 

consisting of a solution comprising ATP and Mg^ ions, a solution comprising ATP and a 
10 magnesium-chelating agent, a solution comprising high salt, a solution comprising a gamma- 
modified ATP analog and Mg^"*" ions, and a solution comprising a gamma-hydrolysis-resistant 
9-1- 2+ 

ATP analog and Mg ions. Preferably, the releasing solution comprises ATP and Mg ions. 
If the mismatched duplex DNA does not have a free end, it may be released from the MSH 
dimer by contacting the MSH dimer with a releasing solution. This releasing solution may be 
15 selected from the group consisting of a solution comprising a magnesium-chelating agent, a 
solution comprising high salt, a solution comprising a double-stranded DNA cleaving enzyme, 



ATP and Mg^"^ ions, a solution comprising a double-stranded DNA cleaving enzyme, a 



gamma-modified ATP analog, and Mg^"^ ions, and a solution comprising a double-stranded 
DNA cleaving enzyme, a gamma-hydrolysis-resistant ATP analog, and Mg^^ ions. According 
20 to one embodiment, after contacting the mismatched DNA and the MSH dimer, the MSH dimer 
may be contacted with a MutL homolog. 

[0023] In another aspect of this method, association of the MSH dimer with the mismatched 
duplex DNA is detected after or while contacting the MSH dimer with the mismatched duplex 
DNA. Association of the MSH dimer with the mismatched duplex DNA may be detected, for 
25 example, using an assay selected from the group consisting of a gel mobility shift assay, a filter 
binding assay, an immunological assay, a sedimentation centrifiigation assay, a spectroscopic 
assay, an optical affinity assay, a DNA footprint assay, and a nucleolytic cleavage protection 
assay. 

[0024] In still another aspect of this method, the duplex DNA with which the MSH dimer is 
30 contacted does not have a free end. If the MSH dimer is present in molar excess with respect to 
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the mismatched duplex DNA, then an average of more than one the MSH dimer associates with 
one molecule of the mismatched duplex DNA. 

[0025] The invention also includes a method of modifying a mismatched duplex DNA 
which does not have a free end. This method comprising contacting the mismatched duplex 
5 DNA and an MSH dimer having ADP bound thereto in the presence of a binding solution. The 
concentration of ATP in the binding solution is less than about 3 micromolar, and the homolog 
associates with the mismatched region of the mismatched duplex DNA, thereby modifying the 
mismatched duplex DNA. 

[0026] The invention further includes a method of segregating a mismatched duplex DNA 
10 from a population of DNA molecules. The method comprises contacting an MSH dimer and 
13 the population in the presence of a binding solution and segregating the MSH dimer from the 

:Ji population. The binding solution comprises a nucleotide selected from the group consisting of 

'p ADP and ATP, an the concentration of ATP in the binding solution is less than about 3 

''0 micromolar. The MSH dimer associates with the duplex DNA in the presence of the binding 

^;|3 15 solution. When the MSH dimer is segregated from the population, the mismatched duplex 

DNA is also segregated from the population. 
3^ [0027] The invention still further includes a method of detecting a difference between a 

fU sample nucleotide sequence and a reference nucleotide sequence. According to this method, a 

first DNA strand and a second DNA strand are annealed to form a duplex DNA. The first DNA 
20 strand has the sample nucleotide sequence, and the second DNA strand has a nucleotide 

sequence which is complementary to the reference nucleotide sequence. If there is a difference 
between the sample nucleotide sequence and the reference nucleotide sequence, then the duplex 
DNA is a mismatched duplex DNA, The duplex DNA and an MSH dimer are contacted in the 
presence of a binding solution comprising a nucleotide selected from the group consisting of 
25 ADP and ATP. The concentration of ATP in the binding solution is less than about 3 

micromolar, and the MSH dimer associates with the duplex DNA if the duplex DNA is a 
mismatched duplex DNA. According to this method, it is then determined whether the MSH 
dimer is associated with the duplex DNA molecule. Association of the MSH dimer with the 
duplex DNA molecule is an indication that there is a difference between the sample nucleotide 
30 sequence and the reference nucleotide sequence. 
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[0028] In addition, the invention includes a kit for separating a mismatched duplex DNA 

from non-mismatched duplex DNAs. The kit comprises at least two MutS homologs^ a linker 

for binding the at least one of the MutS homologs to a support, and an additional reagent. The 

reagent may, for example, be selected from the group consisting of a nucleotide and a releasing 

5 solution, wherein the nucleotide is selected from the group consisting of ADP and ATP, and 

9_[_ 

wherein the releasing solution comprises Mg and a compound selected from the group 
consisting of ATP, a gamma-modified ATP analog, and a gamma-hydrolysis-resistant ATP 
analog. 

[0029] The invention also includes a method of determining whether a mammal is 
10 predisposed for carcinogenesis. This method comprises annealing a first DNA strand and a 
second DNA strand to form a duplex DNA. The first DNA strand has the nucleotide sequence 
of at least a portion of a gene selected from the group consisting of an oncogene and a tumor 
suppressor gene of the mammal. The second DNA strand has a nucleotide sequence which is 
complementary to the consensus nucleotide sequence of this region. If there is a sequence 
15 difference between the first DNA strand and the second DNA strand then the duplex DNA is a 
mismatched duplex DNA. The duplex DNA and an MSH dimer are contacted in the presence 
of a binding solution comprising a nucleotide selected from the group consisting of ADP and 
ATP. The concentration of ATP in the binding solution is less than about 3 micromolar, and 
the MSH dimer associates with the duplex DNA if the duplex DNA is a mismatched duplex 
20 DNA. According to this method, it is determined whether the MSH dimer is associated with 
the duplex DNA, whereby association of the MSH dimer with the duplex DNA is an indication 
that the mammal is predisposed for carcinogenesis. 

[0030] The invention fiirther includes a method of fractionating a population of duplex 
DNAs. This method comprises contacting the population with an MSH dimer in the presence 
25 of a binding solution comprising a nucleotide selected from the group consisting of ADP and 
ATP. The concentration of ATP in the binding solution is less than about 3 micromolar, and 
the MSH dimer associates with at least one mismatched duplex DNA in the population. The 
MSH dimer is segregated from the population of duplex DNAs, whereby the mismatched 
duplex DNA is also segregated from the population. The population is thereby fractionated. 
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[0031] The invention still further includes a method of selectively amplifying at least one 
mismatched duplex DNA of a population of duplex DNAs. This method comprises contacting 
the population with an MSH dimer in the presence of a binding solution comprising a 
nucleotide selected from the group consisting of ADP and ATP. The concentration of ATP in 
the binding solution is less than about 3 micromolar, and the MSH dimer associates with the 
mismatched duplex DNA. The MSH dimer is thereafter segregated from the population of 
duplex DNAs, whereby the mismatched duplex DNA is also segregated from the population of 
duplex DNAs. The mismatched duplex DNA is then amplified, whereby the mismatched 
duplex DNA is selectively amplified. 

[0032] The invention also includes a method of determining whether the nucleotide 
sequence of a first copy of a genomic sequence differs from the nucleotide sequence of a second 
copy of the genomic sequence. This method comprises amplifying a region of each of the first 
copy and the second copy of the genomic sequence to yield amplified first copies and amplified 
second copies. The amplified first copies and the amplified second copies are mixed and 
denatured to form a first mixture. The nucleic acids in the first mixture are then annealed to 
form a second mixture comprising duplex DNAs. If the nucleotide sequence of first copy and 
the nucleotide sequence of the second copy of the genomic sequence differ, then at least some 
of the duplex DNAs in the second mixture are mismatched duplex DNAs. The annealed second 
mixture is contacted with an MSH dimer in the presence of a binding solution comprising a 
nucleotide selected from the group consisting of ADP and ATP. The concentration of ATP in 
the binding solution is preferably less than about 3 micromolar, whereby the MSH dimer 
associates v^th mismatched duplex DNA. According to this method, it is then determined 
whether the MSH dimer is associated v^th at least some of the duplex DNAs. Association of 
the MSH dimer with at least some of the duplex DNAs is an indication that the nucleotide 
sequence of the first copy of the genomic sequence differs from the nucleotide sequence of the 
second copy of the genomic sequence. 

[0033] The invention further includes a composition for segregating a mismatched duplex 
DNA from a population of duplex DNAs. The composition comprises an MSH heterodimer 
bound to a support. 
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[0034] The invention still further includes a kit for screening a genomic region for a 
nucleotide sequence which differs from a reference nucleotide sequence. This kit comprises a 
pair of primers complementary to the ends of the region for amplifying the region, a DNA 
strand having the reference nucleotide sequence, and at least tv^o MutS homologs. 
5 [0035] The invention yet further relates to a nonhuman mammal which is nuUizygous for 
both Msh2 and p53. The mammal does not express Msh2 or p53 and exhibits a phenotype 
selected from the group consisting of inappropriate fetal apoptosis and a predisposition for 
carcinogenesis. 

[0036] The invention also relates to a method of making a nonhuman mammal which is 
10 nullizygous for both Msh2 and p53, does not express Msh2 or p53, and exhibits a phenotype 
0 selected from the group consisting of a predisposition for inappropriate fetal apoptosis and a 

^ predisposition for carcinogenesis. This method comprises mating 

P a) a first parent manmial which comprises at least one null allele of Msh2 and at least 

';|3 one null allele of p53 and 

15 b) a second parent mammal comprising at least one null allele of Msh2 and at least one 

null allele of p53. As a result of this mating, a non-human mammal is generated which is 
nullizygous for both Msh2 and p53, does not express Msh2 or p53, and exhibits a phenotype 
W selected from the group consisting of inappropriate fetal apoptosis and a predisposition for 

1^ carcinogenesis. 

20 [0037] The invention further relates to a method of determining whether a compound 

affects tumorigenesis in mammals. This method comprises administering the compound to a 
first nonhuman mammal which is nullizygous for both Msh2 and p53, does not express Msh2 
or p53, and exhibits a predisposition for carcinogenesis. Tumor incidence in the first nonhuman 
mammal is compared with tumor incidence in a second nonhuman mammal of the same type 
25 which is nullizygous for both Msh2 and p53, does not express Msh2 or p53, exhibits a 

predisposition for carcinogenesis, and to which the compound is not administered. A difference 
in tumor incidence in the first transgenic mammal compared with tumor incidence in the second 
transgenic mammal is an indication that the compound affects tumorigenesis in mammals. 
[0038] The invention still further relates to a method of determining whether a compound 
30 affects a biological phenomenon in mammals. The phenomenon may, for example, be selected 
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from the group consisting of apoptosis, aging, and fetal development. The method comprises 
administering the compound in utero to a first nonhuman mammalian embryo which is 
nuUizygous for both Msh2 and p53, does not express Msh2 or p53, and exhibits a 
predisposition for inappropriate fetal apoptosis. The development of the first nonhuman 
mammalian embryo is compared with the development of a second nonhuman mammalian 
embryo of the same type which is nuUizygous for both Msh2 and p53, does not express Msh2 
or p53, exhibits a predisposition for inappropriate fetal apoptosis, and to which the compound is 
not administered. A difference in the development of the first nonhiman mammalian embryo 
compared with the development of the second nonhuman mammalian embryo is an indication 
that the compound affects the biological phenomenon in mammals. 

[0039] The invention yet further relates to a cell line which is nuUizygous for both Msh2 
and p53, does not express Msh2 or p53, and exhibits a phenotype selected from the group 
consisting of a predisposition for carcinogenesis and a predisposition for apoptosis. The cell 
line is made by culturing a cell obtained from the nonhuman mammal described herein. 
[0040] The invention also relates to a method of determining whether a composition affects 
expression of a gene selected from the group consisting of the p53 gene and a gene encoding a 
MutS homolog. This method comprising administering the composition to a first non-human 
mammal which is nuUizygous for one of the p53 gene and the gene encoding a MutS homolog. 
A phenotype of the non-human mammal is compared with the phenotype of a second non- 
human mammal of the same type which is not nuUizygous for the one of the p53 gene and the 
gene encoding a MutS homolog, wherein the phenotype is selected from the group consisting of 
inappropriate fetal apoptosis and a predisposition for carcinogenesis. A difference between the 
phenotype of the first non-human mammal and the phenotype of the second non-human 
mammal is an indication that the composition affects expression of the other of the p53 gene 
and the gene encoding a MutS homolog. 

[0041] The invention further relates to a method of determining whether a composition 
affects expression of a gene selected from the group consisting of the p53 gene and a gene 
encoding a MutS homolog. This method comprises administering the composition to a first cell 
derived from a non-human mammal which is nuUizygous for one of the p53 gene and the gene 
encoding a MutS homolog. A phenotype of the first cell is compared with the phenotype of a 
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second cell derived from a non-human mammal of the same type which is not nuUizygous for 
the one of the p53 gene and the gene encoding a MutS homology wherein the phenotype is 
selected from the group consisting of inappropriate fetal apoptosis and a predisposition for 
carcinogenesis. A difference between the phenotype of the first cell and the phenotype of the 
5 second cell is an indication that the composition affects expression of the other of the p53 gene 
and the gene encoding a MutS homolog. 

[0042] The invention still further relates to a composition comprising a human MutS 
homolog fragment, wherein the fragment comprises a MutS homolog interaction region. 
[0043] The invention yet further relates to a method of inhibiting association of a first 
10 human MutS homolog and a second human MutS homolog. This method comprises contacting 
V at least one of the first human MutS homolog and the second human MutS homolog with a 

human MutS homolog fragment comprising a MutS homolog interaction region. Inhibition of 

ijj 

IS the first and the second human MutS homologs is thus inhibited. 

[0044] The invention also relates to a composition comprising substantially purified 

^13 15 hMSH5.^^ 

15 [0045] The invention further relates to a composition comprising an isolated nucleic acid 

§^ encoding hMSH5 . 

[0046] The invention still further includes an alternate method of modifying a mismatched 

h^^ duplex DNA. This method comprises contacting an MSH dimer and the mismatched duplex 

20 DNA in the presence of a binding solution comprising ADP, The concentration of ADP in the 
binding solution is at least about ten times the concentration of ATP, if ATP is present in the 
binding solution. The MSH dimer thereby associates with the mismatched region of the 
mismatched duplex DNA and modifies the mismatched duplex DNA. 



25 BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWINGS 

[0047] Figure 1, comprising Figures lA, IB, IC, ID, IE, and IF, depict binding of 
hMSH2:hMSH6 heterodimer to mismatched and non-mismatched duplex DNA. Figure 1 A is 
an image of the results of a gel mobility shift assay performed using the G/T-mismatched 81- 
base pair duplex DNA substrate described herein. The concentrations of heterodimer used in 

30 the assay are indicated along the top of the image. The position of the S-shifted electrophoretic 
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band is indicated by "S". Figure IE is a graph which depicts the relationship between the 
concentration of heterodimer and the amount of product corresponding to the S-shifted 
electrophoretic band in Figure 1 A, as assessed using a phosphoimaging device. Figure IB is an 
image of the results of a gel mobility shift assay performed using the homologous 81 -base pair 
5 duplex DNA substrate described herein. The concentrations of heterodimer used in the assay 
are indicated along the top of the image. The position of the NS-shifted electrophoretic band is 
indicated by "NS". Figure IF is a graph which depicts the relationship between the 
concentration of heterodimer and the amount of product corresponding to the NS-shifted 
electrophoretic band in Figure IB, as assessed using a phosphoimaging device. Figure IC is an 
10 image which depicts the resuhs of a DNase footprint assay performed using the 81 -base pair 
C3 G/T-mismatched duplex DNA substrate described herein. The concentrations of 81 -base pair 

-JK are indicated along the top of the image. The position of the G residue of the G/T-mismatched 

substrate is indicated by "G", and the approximate region of the substrate protected from DNase 
cleavage by the heterodimer is indicated by a vertical line. Figure ID is an image which depicts 
v|3 15 the results of a DNase footprint assay performed using the homologous 81 -base pair duplex 
v/^H DNA substrate described herein. The concentrations of heterodimer used in the assay are 

indicated along the top of the image. The position of the G/C base pair corresponding to the 
IIJ G/T-mismatched base pair of the mismatched substrate is indicated by "G". 

l5 [0048] Figure 2, comprising Figures 2A, 2B, 2C, and 2D, depicts the results of gel mobility 

20 shift assays used to assess the ability of various adenine nucleotides to dissociate MSH dimer 
from the mismatch site, corresponding to the S-shifted electrophoretic band, such that the MSH 
dimer, corresponding to the NS-shifted electrophoretic band, exhibited DNA-associated 
diffusion. Figure 2A is an image of an assay in which the product corresponding to the S- 
shifted electrophoretic band was incubated in the presence of ATP at the concentration listed 
25 along the top of the image. Figure 2B is an image of an assay in which the product 

corresponding to the S-shifted electrophoretic band was incubated in the presence of adenosine- 
5'-0-3*-thiotriphosphate (ATP-gamma-S) at the concentration listed along the top of the image. 
Figure 2C is an image of an assay in which the product corresponding to the S-shifted 
electrophoretic band was incubated in the presence of ADP at the concentration listed along the 
30 top of the image. In Figures 2A, 2B, and 2C, "-" indicates that no heterodimer was included in 
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the assay mixture. Figure 2D is a graph which depicts quantitated results obtained using the 
resuhs depicted in Figures 2A, 2B, and 2C, as assessed using a phosphoimaging device. 
[0049] Figure 3 is a bar graph which depicts the effect of selected nucleotides, 
deoxynucleotides, and nucleotide analogs on G/T mismatch binding by the heterodimer, 
5 relative to the degree of binding observed in the absence of a (deoxy)nucleotide or analog. The 
effect of each indicated (deoxy)nucleotide or analog was assessed at 25 micromolar (left bar of 
each pair) and at 250 micromolar (right bar of each pair). 

[0050] Figure 4, comprising Figures 4A and 4B depicts the effects of ATP hydrolysis or 
ADP binding by the hMSH2/hMSH6 heterodimer on mismatched DNA binding. Figure 4A is a 

10 graph depicting the resuhs of gel mobility shift assays performed in the presence or absence of 
15 micromolar ATP and in the presence or absence of 15 micromolar ATP-gamma-S. 
Magnesium chloride was added at the time designated "0", and samples of the assay mixture 
were collected at the indicated times (in minutes). The binding reaction in each mixture was 
hahed by addition of 5 millimolar EDTA. Figure 4B is a graph depicting the resuhs of gel 

15 mobility shift assays performed in the presence of the indicated (in millimolar) concentrations 
of ATP or ADP or both. 

[0051] Figure 5 comprises Figures 5A and 5B. Figure 5A is a graph which depicts the 
results obtained in the assays described herein for detecting the rate of a single round of ATP 
hydrolysis by the complex. Figure 5B is a graph which depicts the results obtained in assays 
20 described herein for detecting the rate of a single round of ATP hydrolysis by the complex in 
the presence of selected amounts of mismatched DNA. 

[0052] Figure 6, comprising Figures 6A, 6B, 6C, and 6D, depicts the results of experiments 
performed to assess the effects of ATP, homologous DNA, or both, on the dissociation of the 
hMSH2:hMSH6 heterodimer from DNA, Figure 6 A is an image of the results obtained from 

25 gel mobility shift assays in which heterodimer-bound mismatched DNA was incubated with 

ATP for the time indicated in the image. Figure 6B is an image of the results obtained from gel 
mobility shift assays in which heterodimer-bound mismatched DNA was incubated with ATP 
and a 400-fold excess of homologous DNA for the time indicated in the image. Figure 6C is an 
image of the results obtained from gel mobility shift assays in which heterodimer-bound 

30 mismatched DNA was incubated with a 400-fold excess of homologous DNA for the time 
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indicated in the image. Figure 6D is an image of the results obtained from gel mobility shift 
assays in which the heterodimer was incubated with homoduplex DNA probe for fifteen 
minutes at 3TC (Lane A), the assay mixture was cooled to 4°C, and a 1,100-fold excess of 
unlabeled competitor homoduplex DNA was added (Lane B). In each of Figure 6A, 6B, 6C, 
5 and 6D, indicates assay mixtures which did not comprise the heterodimer. 

[0053] Figure 7 is a diagram which depicts the model of the liMSH2:hMSH6 heterodimer 
association with and dissociation fi-om mismatched duplex DNA described herein. The ADP- 
bound form of the heterodimer ("MSH2"), which is shown in the center of the diagram, is 
competent to bind mismatched duplex DNA, as shown at the bottom of the diagram, but cannot 
10 diffiise from the mismatch site on the DNA. Mismatched DNA-bound complex is enabled to 
O diffuse to a different position on the DNA by displacement of the ADP molecule boxxnd thereto 

3 by an ATP molecule (here indicated " ATP"), which yields the ATP-bound form of the 

% heterodimer. The ATP-bound form of the heterodimer is able to dissociate from a free end of 

the duplex DNA, but not from a blocked end of the duplex DNA. After dissociating from the 
^:fi 15 duplex DNA, the ATP-bound form of the heterodimer is converted to the ADP-bound form by 
f 1 hydrolysis of the heterodimer-bound ATP molecule, catalyzed by intrinsic ATPase activity of 

■^1 the heterodimer. 

ilf [0054] Figure 8 lists the nucleotide sequence of single nucleotide chains of some of the 39- 

and 81-base pair DNA substrates described herein (SEQ ID NOS: 2, 3, 5, and 6). 

20 [0055] Figure 9, comprising Figures 9A, 9B, 9C, and 9D, is a series of images, each of 
which depicts a whole mount view of an Msh2'^'p53"^" embryo at day 1 1 .5 of development. 
The embryo depicted in Figure 9A is a male Msh2"^"p53"^" mouse embryo, and exhibits 
phenotypically normal embryonic development, relative to mice having the same genotypic 
background. The embryos depicted in Figures 9B, 9C, and 9D are female Msh2"'^"p53"^" mouse 

25 embryos that are littermates of the male mouse depicted in Figure 9A. The female mouse 

embryos depicted in Figures 9B, 9C, and 9D exhibit developmental arrest having a phenotype 
corresponding to that expected at day 9.5 of embryonic development. 
[0056] Figure 10, comprising Panels A, B, C, D, E, and F, is a series of images, each of 
which depicts a paraffin embedded section obtained from an 1 1 .5 day old female mouse 

30 embryo. The images in Panels A, C, and E each depict a section obtained from an 1 1.5 day old 
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normal embryo. The images in Panels B, D, and F each depict a section obtained from an 11 .5 
day old Msh2"^'p53"^" mouse embryo. The sections depicted in Panels A and B are at lOOx 
magnification and are stained with hematoxylin and eosin. Magnification of the normal embryo 
is of the somite region of a sagittal section. The sections depicted in Panels C and D are at 
5 lOOx magnification and are chromogenically-TUNEL stained. The sections depicted Panels E 
and F are at 40x magnification and are fluorescently-TUNEL stained. Cells undergoing 
apoptosis in normal female embryos were rare; chromogenically- and fluorescently-TUNEL 
stained cells depicted in Panels C and E represent circumscribed apoptotic foci normally found 
in developing mouse embryos. 
10 [0057] Figure 1 1 is a graph which depicts Kaplan-Meier survival probabilities of Msh2"^", 
O p53"^", and Msh2"^"p53"^" mice. 

[0058] Figure 12 is a diagram which indicates the primary structure of S-labeled 

% IVTT-hMSH3 polypeptides used to identify approximate boundaries of hMSH2-interaction 

, .{^■- 

regions of hMSH3. "Amino Acid Number" refers to the amino acid residues of hMSH3 which 
15 the corresponding IVTT-hMSH3 polypeptide comprised. The rectangular entities in the central 
part of the figure represent relative positions of the amino acid residues which the 
corresponding IVTT-hMSH3 polypeptide comprised with respect to full length hMSH3, which 
is represented by polypeptide 1). The symbol, A, indicates a deleted region of a polypeptide. 
The shaded regions of polypeptide 1) represent the hMSH2-interaction regions of hMSH3. 
20 "Interaction with hMSH2" indicates whether or not the corresponding polypeptide interacted 
with GST-hMSH2. 

[0059] Figure 13 is a diagram which indicates the primary structure of S-labeled 
IVTT-hMSH2 polypeptides used to identify approximate boundaries of hMSH3 -interaction 
regions of hMSH2. "Amino Acid Number" refers to the amino acid residues of hMSH2 which 
25 the corresponding IVTT-hMSH2 polypeptide comprised. The rectangular entities in the central 
part of the figure represent relative positions of the amino acid residues which the 
corresponding IVTT-hMSH2 polypeptide comprised with respect to full length hMSH2, which 
is represented by polypeptide 1). The shaded regions of polypeptide 1) represent the hMSH3- 
interaction regions of hMSH2. "Interaction with hMSH3" indicates whether or not the 
30 corresponding polypeptide interacted with GST-hMSH3. 



m 
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[0060] Figure 14 is a diagram which indicates the primary structure of S-labeled 
IVTT-hMSH2 polypeptides used to identify the linear orientation of the hMSH3-interaction 
regions of hMSH2. "Amino Acid Number" refers to the amino acid residues of hMSH2 which 
were present in the corresponding IVTT-hMSH2 polypeptide. The rectangular entities in the 
5 central part of the figure represent relative positions of the amino acid residues which were 
present in the corresponding IVTT-hMSH2 polypeptide with respect to full length hMSH3, 
which is represented by polypeptide 1), The symbol, A, indicates a deleted region of a 
polypeptide. "Interaction with specific hMSH3 domains" indicates whether or not the 
corresponding polypeptide interacted with a GST-hMSH3 fusion protein comprising the amino- 
10 terminal ("^4"^") interaction region of hMSH3 or with a GST-hMSH3 fusion protein 
G comprising the carboxy-terminal ("COO"") interaction region of hMSH3 . 

^jg [0061] Figure 15 is a diagram which indicates the primary structure of S-labeled 

IVTT-hMSH6 polypeptides used to identify approximate boundaries of hMSH2-interaction 
jfi regions of hMSH6. "Amino Acid Number" refers to the amino acid residues of hMSH6 which 

^i|3 15 were present in the corresponding IVTT-hMSH2 polypeptide. The rectangular entities in the 

central part of the figure represent relative positions of the amino acid residues which were 
3^ present in the corresponding IVTT-hMSH6 polypeptide with respect to full length hMSH6, 

^ili which is represented by polypeptide 1). The shaded regions of polypeptide 1) represent the 

hMSH2-interaction regions of hMSH6. "Interaction with hMSH2" indicates whether or not the 
20 corresponding polypeptide interacted with GST-hMSH2. 

[0062] Figure 16 is a diagram which indicates the primary structure of S-labeled 
IVTT-hMSH2 polypeptides used to identify approximate boundaries of hMSH6-interaction 
regions of hMSH2, "Amino Acid Number" refers to the amino acid residues of hMSH2 which 
were present in the corresponding IVTT-hMSH2 polypeptide. The rectangular entities in the 
25 central part of the figure represent relative positions of the amino acid residues which the 

corresponding IVTT-hMSH2 polypeptide comprised with respect to full length hMSH2, which 
is represented by polypeptide 1). The shaded regions of polypeptide 1) represent the hMSH6- 
interaction regions of hMSH2. "Interaction with hMSH3" indicates whether or not the 
corresponding polypeptide interacted with GST-hMSH6. 
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[0063] Figure 17 is a diagram which indicates the primary structure of S-labeled 
IVTT-hMSH2 polypeptides used to identify the Hnear orientation of the hMSH6-interaction 
regions of hMSH2. "Amino Acid Number" refers to the amino acid residues of hMSH2 which 
were present in the corresponding IVTT-hMSH2 polypeptide. The rectangular entities in the 
5 central part of the figure represent relative positions of the amino acid residues which were 
present in the corresponding IVTT-hMSH2 polypeptide with respect to full length hMSH6, 
which is represented by polypeptide 1). The symbol, A, indicates a deleted region of a 
polypeptide. "Interaction with specific hMSH6 domains" indicates whether or not the 
corresponding polypeptide interacted with a GST-hMSH6 fusion protein comprising the amino- 
10 terminal ("NH4"^") interaction region of hMSH6 or with a GST-hMSH6 fusion protein 
O comprising the carboxy-terminal ("COO"") interaction region of hMSH6. 

:15 [0064] Figure 1 8 is a diagram which illustrates a model of liMSH2 consensus interaction 

> with hMSH3 or hMSH6. The interaction regions of hMSH2, hMSH3, and hMSH6 are 

indicated in gray and are connected with lines that illustrate the specificity of each region to its 
h|3 15 corresponding interaction partner region. The nucleotide binding regions of hMSH2, hMSH3, 
and hMSH6 are indicated as black boxes. The location of HNPCC-associated mutations tested 
in these studies are illustrated as black diamonds. 
W [0065] Figure 19, comprising Figures 19A, 19B, and 19C, Usts the nucleotide sequence of 

Ip cDNA encoding hMSH5 (SEQ ID NO: 30) and the putative amino acid sequence of hMSH5 

20 (SEQ ID NO: 29). 

DETAILED DESCRIPTION OF THE INVENTION 
[0066] The invention relates to a method of binding one or more MutS homolog (MSH) 
dimers to a mismatched duplex DNA. The invention also relates to methods of using adenine 

25 nucleotides to modulate recognition of mismatched duplex DNA and to modulate DNA- 

associated diffusion of MSH dimers after binding of such dimers to mismatched duplex DNA. 
The invention further relates to a method of binding a complex comprising a MutL homolog 
and a MutS homolog to mismatched duplex DNA. The MutL homolog interacts with the MutS 
homolog and influences the ability of the MutS homolog to bind with a mismatched region of 

30 the duplex DNA. 
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A Summary of Some of the Novel Properties of 
MutS Homologs and MutL Homologs 

[0067] The compositions, kits, and methods of the invention may be better understood by 
5 understanding the novel properties of MutS homologs and MutL homologs v^hich have been 
discovered by the inventors. This section presents merely a brief introduction to several these 
properties. It is understood that the operability of the compositions, kits, and methods of the 
invention does not depend upon the correctness of the information provided in this section. 
[0068] An important aspect of the invention is the discovery that MutS homolog (MSH) 

10 dimers and, in some organisms, MSH heterodimers, associate with mismatched regions of a 
mismatched duplex DNA. Binding of a MutS dimer to mismatched DNA occurs when ADP, 
but not ATP, is bound to the MSH dimer. The MSH dimer may, for example, be in the form of 
an MSH homodimer (e.g. an E. coli MutS dimer) or an MSH heterodimer (e.g. a himian MSH 
heterodimer such as an hMSH2:hMSH3 dimer, an hMSH2:hMSH6 dimer, or an 

15 hMSH4:hMSH5 dimer). This association may be effected either in vitro or in vivo. 

[0069] ADP-bound MSH dimer associated with a mismatched region of a mismatched 
duplex DNA does not move along the duplex DNA, but instead remains located at the 
mismatched region. Exchange of ATP for the ADP bound to the MSH dimer confers to the 
MSH dimer DNA-associated diffusibility, which means that the MSH dimer becomes able to 

20 move from the site of the mismatched region of the duplex DNA to another site on the same 
duplex DNA. If the mismatched duplex DNA has a free end, then the DNA-associated 
diffusibility of an ATP-bound MSH dimer enables the dimer to the duplex DNA dissociate 
from the duplex DNA. If the mismatched duplex DNA does not have a free end (e.g. the DNA 
is circular or has bulky moieties such as proteins bound to the ends thereof), then neither the 

25 ADP-bound form or the ATP-bound form of the MSH dimer is able to dissociate from the 
duplex DNA. 

[0070] Because MSH heterodimers, in their ATP-bound form, exhibit DNA-associated 
diffusibility with regard to the duplex DNA with which they are associated, an ATP-bound 
MSH dimer will not necessarily be associated with the mismatched region of a mismatched 
30 duplex DNA, but instead may have diffused away from the mismatched region to 
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complementary region of the same mismatched duplex DNA. Thus, a mismatched duplex 
DNA having one or more ATP-bound MSH dimers associated therewith is able to associate 
with another MSH dimer in an ADP-bound form. Therefore, numerous MSH dimers may be 
associated with a mismatched duplex DNA by contacting the DNA with ADP-bound MSH 
5 dimers in the presence of a binding solution which comprises ATP. It is understood that certain 
MSH homodimers (e.g. hMSH2 dimers; Fishel et al, 1994, Science 266:1403-1405) exhibit 
little or no alteration in activity associated with adenine nucleotide binding, and may be useful 
for these properties. For example, hMSH2 binds a variety of mismatched nucleotides but 
remains unperturbed in the presence of either ADP or ATP (Fishel et al, 1994, Science 
10 266:1403-1405). 

0 [0071] MSH dimers exhibit an intrinsic ATP hydrolytic activity, and this hydrolytic activity 
is greatly enhanced in their non-DNA-associated form, but not in their DNA-associated form. 

'j: Thus, an ATP-bound MSH dimer associated with DNA remains ATP-bound. However, ATP 

■ 0 bound to an MSH dimer is rapidly converted to ADP if the dimer is not associated with DNA. 

:;;>,:;; 

v|j 15 Thus, the intrinsic ATPase activity exhibited by MSH dimers catalyzes the transformation of an 

ATP-bound dimer (which cannot associate with a mismatched region of DNA) to an ADP- 
■f; bound dimer (which can associate with a mismatched region of DNA). In addition, the 

1 ll mismatched DNA-associated form of MSH dimers are able to more rapidly exchange ATP in 

place of ADP bound to the dimer than MSH dimers not associated with DNA or associated with 

20 non-mismatched DNA, 

[0072] Without wishing to be bound by any particular theory of operation, binding of MSH 
dimers to mismatched duplex DNA may be visualized as illustrated in Figure 7. An ADP- 
bound MSH dimer associates with the mismatched region of the DNA. Exchange of ATP in 
place of the ADP bound to the MSH dimer enables the dimer to diffuse to a different position 

25 on the DNA. The DNA-associated ATP-bound MSH dimer cannot dissociate from a blocked 
end of the DNA in the presence of Mg , but can dissociate from a free end of the DNA. 
Alternately, ATP-bound MSH dimer can be dissociated from DNA which does not have a free 
end in the presence of EDTA or a high salt concentration. ATP-bound MSH dimer not 
associated with DNA is able to hydrolyze the ATP moiety, yielding an ADP-bound MSH 

30 dimer, which is then able to associate with a mismatched region of DNA, 
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[0073] An MSH dimer may be thought of as 'molecular switch; wherein the ADP-bound 
dimer represents an 'ON' state, and wherein the ATP-bound dimer represents an 'OFF' state. In 
the 'ON' state, the dimer is able to associate with a mismatched region of DNA but is not able to 
diffuse to a different position on the DNA with which it is associated. In the 'OFF' state, the 
5 dimer is not able to associate with a mismatched region of DNA but is able to diffuse to a 

different position on the DNA with which it is associated. Recalling the involvement of MutS 
homologs in DNA mismatch repair and, as demonstrated herein, in control of the cell 
replication cycle, it is understood that compounds which modulate the transition of MSH 
dimers from the 'ON' to the 'OFF' state or vice versa may be used to modulate DNA mismatch 
10 repair, timing of and progression through the cell replication cycle, and/or the physiological 

0 process(es) associated with either DNA mismatch repair or the cell replication cycle. 

[0074] A MutL homolog improve the intrinsic ATPase activity exhibited by a MSH dimer 
when the MutL homolog associates with the MSH dimer. MutL homologs may thus be 
analogized to GTPase accelerating proteins (sometimes designated "GAP proteins") which have 
15 been described in the context of G protein activity. Without wishing to be bound by any 

particular theory, it is thought that association of a MutL homolog with a MSH dimer increases 
Iff. the rate of dissociation of the ATP-bound MSH dimer from duplex DNA and increases the rate 

1 U at which ATP is converted to ADP by the non-duplex DNA-associated ATP-bound MSH 
1^ dimer, thereby rendering the MSH dimer able to bind to a mismatched duplex DNA more 

20 rapidly than in the absence of the MutL homolog. 

[0075] The biochemical properties of MutS homologs and MutL homologs described in this 
section are used advantageously in the compositions, kits, and methods of the invention. 

Definitions 

25 [0076] As used herein, each of the following terms has the meaning associated with it in 
this section. 

[0077] A "MutS homolog" is a protein which comprises a region which exhibits significant 
sequence similarity with at least one of the following regions of the human MSH2 protein 
(wherein the regions are indicated by the numbers of the amino acid residues of MSH2 which, 
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inclusively, bound the region; the corresponding amino acid sequences of hMSH2 is indicated 
thereafter in parentheses): 

Region I: hMSH2 amino acid residues 37-57 (LFDRGDFYTA HGEDALLAAR E; 
SEQ ID NO: 24); 

5 Region II: hMSH2 amino acid residues 336-368 (TPQGQRLVNQ WIKQPLMDKN 

RIEERLNLVE AFV; SEQ ID NO: 25); 
Region III: hMSH2 amino acid residues 63 5-662 (LKASRHACVE VQDEIAFIPN 

DVYFEKDK; SEQ ID NO: 26); 
Region IV: hMSH2 amino acid residues 667-770 (IITGPNMGGK STYIRQTGVI 
10 VLMAQIGCFV PCESAEVSIV DCILARVGAG DSQLKGVSTF 

13 MAEMLETASI LRSATKDSLI IIDELGRGTS TYDGFGLAWA ISEY; SEQ 

J ID NO: 27); and 

> Region V: hMSH2 amino acid residues 812-852 (LTMLYQVKKG YCDQSFGIHV 

AELANFPKHV lECAKQKALE L; SEQ ID NO: 28). 

'0 15 The amino acid sequence of hMSH2 has been described (e.g. Fishel et al, 1993, Cell 75:1027), 
r^'i Preferably, the MutS homolog of the invention comprises a region which exhibits significant 

jl^l sequence similarity with Region IV, and more preferably with both Region IV and Region V. It 

is also preferred that the MutS homolog comprises a plurality of regions, each of which exhibits 
1^^ significant sequence similarity with one of Regions I-V of hMSH2, and more preferred that the 

20 MutS homolog comprises regions which independently exhibit significant sequence similarity 
with each of Regions I-V of hMSH2. Thus, MutS homologs which are included in the invention 
include, but are not limited to Aquifex aeolicus MutS, Aquifex aeolicus MSH, Aquifiex 
pyrophilicus MutS, Arabidopsis thaliana MSH2, Arabidopsis thaliana MSH6, Azotobacter 
vinelandii MutS, Bacillus subtilis MutS, Bacillus subtilis MSH, Caenorhabdis elegans MSH4, 
25 Caenorhabdis elegans MSH5, Drosophila melanogaster MSH2, Escherichia coli MutS, Homo 
sapiens MSH2, Homo sapiens MSH3, Homo sapiens MSH4, Homo sapiens MSH5, Homo 
sapiens MSH6, Haemophilus influenzae type B MutS, Helicobacter pylori MSH, Mus musculus 
MSH2, Mus musculus MSH3, Mus musculus MSH6, Neurospora crassa MSH2, Rattus 
norvegicus MSH2, Saccharomyces cerevisiae MSHl, Saccharomyces cerevisiae MSH2, 
30 Saccharomyces cerevisiae MSH3, Saccharomyces cerevisiae MSH4, Saccharomyces cerevisiae 
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in 



MSH5, Saccharomyces cerevisiae MSH6, Saccharomyces pombe MSHl, Saccharomyces 
pombe MSH2, Saccharomyces pombe Swi4, Saccharomyces pombe MutS, Salmonella 
typhimurium MutS, Synechocystis sp. MutS, Synechocystis sp. MSH, Thermus aquaticus 
MutS, Thermotoga maritima MutS, and Thermus thermophilus MutS, each of which proteins is 
5 described either herein or in the prior art. 

[0078] A "MutL homolog" is a protein which exhibits significant similarity to the MutL 
protein of E. coU. MutL homologs include, but are not limited to, eukaryotic MLHl, MLH2, 
PMSl, and PMS2 proteins. 

[0079] A protein or a region of a protein exhibits "significant similarity" to another protein 
10 or a region of another protein if, when the two proteins or regions are compared in a selected 
alignment, at least 50%, at least 70%, at least 85%, at least 95%, or at least 99% of the aUgned 
amino acid residues of the two proteins or the two regions are either identical or similar. 
Similar amino acid residues are indicated by the groups listed on the following lines: 
glycine, alanine; 



15 valine, isoleucine, leucine; 



aspartic acid, glutamic acid; 

asparagine, glutamine; 

serine, threonine; 

lysine, arginine; and 
20 phenylalanine, tyrosine. 

[0080] A "heterodimer" is a protein which comprises more than one subunit, wherein at 
least one subunit has an amino acid sequences which is different from the amino acid sequence 
of another subunit of the same protein. Heterodimers having an A' protein subunit and a 'B' 
protein subunit are herein designated "A:B heterodimers". 
25 [0081] A "DNA strand" is a single polydeoxyribonucleotide. 

[0082] A "duplex DNA" is a molecule that comprises at least one polydeoxyribonucleotide, 
wherein at least a portion of the polydeoxyribonucleotide has a double-stranded, hydrogen 
bonded conformation. 

[0083] A "mismatched" duplex DNA is a duplex DNA wherein at least one DNA strand 
30 comprises a region which has at least one nucleotide residue that is not base-paired with a 
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complementary nucleotide residue and which is flanked by regions wherein at least about ten 
nucleotide residues are all base-paired with complementary nucleotide residues. 
[0084] A first region of an DNA "flanks" a second region of the DNA if the two regions are 
adjacent one another or if the two regions are separated by no more than about 10 nucleotide 
residues, and preferably no more than 1 nucleotide residue. 

[0085] A "non-mismatched" duplex DNA is a duplex DNA wherein all nucleotide 
residues of the double-stranded portion thereof are base-paired with complementary nucleotide 
residues. 

[0086] "Complementary" refers to the broad concept of sequence complementarity between 
regions of two nucleic acid strands or between two regions of the same nucleic acid strand. It is 
known that an adenine residue of a first nucleic acid region is capable of forming specific 
hydrogen bonds ("base pairing") with a residue of a second nucleic acid region which is 
antiparallel to the first region if the residue is thymine or uracil. Similarly, it is known that a 
cytosine residue of a first nucleic acid strand is capable of base pairing with a residue of a 
second nucleic acid strand which is antiparallel to the first strand if the residue is guanine. A 
first region of a nucleic acid is complementary to a second region of the same or a different 
nucleic acid if, when the two regions are arranged in an antiparallel fashion, at least one 
nucleotide residue of the first region is capable of base pairing with a residue of the second 
region. Preferably, the first region comprises a first portion and the second region comprises a 
second portion, whereby, when the first and second portions are arranged in an antiparallel 
fashion, at least about 50%, and preferably at least about 75%, at least about 90%, or at least 
about 95% of the nucleotide residues of the first portion are capable of base pairing with 
nucleotide residues in the second portion. More preferably, all nucleotide residues of the first 
portion are capable of base pairing with nucleotide residues in the second portion, 
[0087] A chemical entity such as a molecule is "bound" to another chemical entity if at least 
one portion of each of the two chemical entities are covalently or non-covalently bonded to one 
another in an essentially fixed position. By way of example, as described herein, an ADP- 
bound form of an MSH dimer is bound to a mismatched region of a duplex DNA because the 
MSH dimer predominantly associates with the duplex DNA at the location of the mismatch. 
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[0088] A chemical entity such as a molecule is "associated" with another chemical entity if 
at least one of the chemical entities can change its position relative to the other without 
becoming dissociated therefrom. By way of example, as described herein, an ATP-bound form 
of an MSH dimer is associated with a mismatched duplex DNA because the MSH dimer can 
5 diffuse to a different position on the DNA without dissociating therefrom. 

[0089] A duplex DNA is "modified" if a chemical entity such as a molecule is bound to, 
associated with, or dissociated from the duplex DNA, or if the duplex DNA is segregated from 
a population of DNA molecules. 

[0090] A duplex DNA has a "free end" if the duplex DNA is not circular and if both ends of 
10 the duplex DNA are not blocked. 
O [0091] An end of a duplex DNA is "blocked" if a bulky moiety is bound to a portion of the 

: p duplex DNA between a reference point on the duplex DNA and the end of the duplex DNA. 

^ tS [0092] A "bulky moiety" bound to a portion of a duplex DNA is any chemical entity which 

has a size sufficient to prevent sliding of an ATP-bound MSH dimer along the DNA duplex 
=ig 15 from a location on one side of the bulky moiety to a location on the other side of the bulky 
fn moiety. Examples of bulky moieties include proteins, metallic, glass, or polymeric surfaces, 

and the like. 

iU [0093] A "gamma-modified ATP analog" is an ATP molecule which has an a group 

hi^ attached to the gamma phosphodiester moiety thereof^ whereby the beta-gamma phosphodiester 

20 linkage is cleaved by an MSH dimer with an efficiency less than 25% of the efficiency with 

which ATP is hydrolyzed by the MSH dimer. By way of example, ATP-gamma-S is a gamma- 
modified ATP analog. 

[0094] A "gamma-hydrolysis-resistant ATP analog" is an ATP molecule which has an 
altered beta-gamma phosphodiester linkage chemistry whereby the altered beta-gamma 

25 phosphodiester linkage cannot be cleaved be either the intrinsic ATP hydrolytic activity of an 
MSH dimer or by the ATP hydrolytic activity of an MSH dimer-MutL homolog complex. 
Examples of gamma-hydrolysis-resistant ATP analogs include, but are not limited to ATP-PNP 
and ATP-PCP, which are compounds well known and described in the art. 
[0095] A solution is "substantially free" of ATP when the concentration of ATP is very low 

30 (e.g. less than 30 nanomolar, and preferably less than 1 nanomolar). 
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[0096] The term "substantially pure" describes a compound, e.g., a protein or polypeptide 
which has been separated from components which naturally accompany it. Typically, a 
compound is substantially pure when at least 10%, more preferably at least 20%, more 
preferably at least 50%, more preferably at least 60%, more preferably at least 75%, more 
5 preferably at least 90%, and most preferably at least 99% of the total material (by volume, by 
wet or dry weight, or by mole percent or mole fraction) in a sample is the compound of interest. 
Purity can be measured by any appropriate method, e.g., in the case of polypeptides by column 
chromatography, gel electrophoresis or HPLC analysis, A compound, e.g., a protein, is also 
substantially purified when it is essentially free of naturally associated components or when it is 
10 separated from the native contaminants which accompany it in its natural state. 

[0097] "NuUizygous" refers to an animal which possesses a pair of null mutant alleles at a 
given genetic locus. Hence, a nuUizygous Xxx mouse (wherein Xxx is any gene normally 
present in a mouse) does not possess a functional Xxx gene, whereas a wild-type mouse may 
55 possess one or two functional copies of the Xxx gene. To illustrate the notation used herein, the 

'iQ 15 term "nuUizygous Xxx mouse" is synonymous with the term "Xxx" " mouse." Similarly, a 
"heterozygous Xxx mouse" has one functional Xxx allele and one non-fiinctional Xxx allele, 
and is synonymous with the term "Xxx mouse." A "wild type mouse" has at least one copy, 
U and possibly two copies, of a fimctional Xxx allele, and is synonymous with the term "Xxx 

mouse." A "homologous wild type mouse' has two copies of a functional Xxx allele, and is 
20 synonymous with the term "Xxx"^^"^ mouse." 

[0098] As used herein, an "instructional material" includes a publication, a recording, a 
diagram, or any other medium of expression which can be used to communicate the usefulness 
of the compositions and methods of the invention for associating a MSH dimer with a 
mismatched duplex DNA. The instructional material of the kit of the invention may, for 
25 example, be affixed to a container which contains the dimer or be shipped together with a 

container which contains the dimer. Alternatively, the instructional material may be shipped 
separately from the container with the intention that the instructional material and the dimer be 
used cooperatively by the recipient. 

[0099] A solution comprises "high salt" if the concentration of one or more salts in the 
30 solution is, cumulatively, at least about 1 molar, preferably at least about 3 molar. 



-28- 



[0100] A "double-stranded DNA-cleaving enzyme" is an enzyme which catalyzes 
hydrolysis of both strands of a duplex DNA, leaving either blunt or staggered ends. Examples 
of double-stranded DNA-cleaving enzymes include, but are not limited to, restriction 
endonucleases. 

5 

Description 

[0101] The invention relates to a method of modifying a mismatched duplex DNA. The 
method comprises contacting a MutS homolog (MSH) dimer and the mismatched duplex DNA 
in the presence of a binding solution. The binding solution comprises either ADP and ATP, 
10 and the concentration of ATP in the binding solution is less than about 3 micromolar, preferably 
O less than about 0.3 micromolar, and more preferably v^erein the binding solution is 

^ substantially free of ATP. Ahemately, ADP is used in the absence of ATP, or at least in excess 

with respect to ATP (i.e. ADP at a 2-ft>ld, 10-fbld, or lOO-fold or greater excess relative to 
''0 ATP). The MSH dimer thereby binds ADP. When the ADP-boxxnd MSH dimer is contacted 

15 with the mismatched duplex DNA, the dimer associates v^th the mismatched region of the 
DNA, thus forming a modified mismatched duplex DNA. 
■^^ [0102] The MSH dimer may be a homodimer or a heterodimer of any MiatS homolog which 

tU is presently known to comprise or is discovered to comprise one or more which exhibits 

|i; significant sequence similarity with at least one of Region I-V of human MSH2 (hMSH2), as 

20 described herein. The MutS homolog may be a prokaryotic MutS homolog or a eukaryotic 

MutS homolog. Preferably, the MutS homolog is a heterodimer, more preferably a heterodimer 
comprising MutS homologs obtained from a single species of organism. Thus, by way of 
example, the MSH dimer useful in the methods, kits, and compositions of the invention may be 
the E. coli MutS protein, an hMSH2 homodimer, a heterodimer comprising hMSH2 and either 
25 hMSH3 or hMSH6, a heterodimer comprising hMSH4 and hMSH5, a yeast MSH2 protein 
homodimer, a heterodimer comprising yeast MSH2 and either yeast MSH3 or yeast MSH6, a 
homodimer of a rat MSH2 (e.g. GenBank accession number X93591), a dimer of a Xenopus 
homolog of hMSH2 (Varlet et al, 1994, Nucl. Acids Res. 22:5723-5728), a homodimer of 
Drosophila MSH2 (e.g. GenBank accession number U 17893), a homodimer of murine MSH2 
30 (e.g. GenBank accession number X93591, Varlet et al, 1994, Nucl. Acids Res. 22:5723-5728), 
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a heterodimer comprising murine MSH2 and either murine MSH3 (e.g. Rep-3; Linton et aL, 
1989, MoL Cell. Biol. 9:3058-3072; Smith et al, 1990, MoL Cell. BioL 10:6003-6012) or 
murine MSH6 (e.g. Gen Bank accession number U42190), and the like. The MutS homolog of 
the MSH dimer used in the compositions, kits, and methods of the invention may also be any of 
5 the 41 MutS homologs and presently listed in the NCBI database. It is understood that, given 
the high degree of similarity among mammalian MutS homologs (Fishel et al, 1997, Curr. Op. 
Genet. Develop. 7:105-113), a dimer of any mammalian hMSH2 homolog can be used in the 
methods of the invention. 

[0103] The mismatched duplex DNA molecule useful in the methods of the invention may 
10 be any duplex DNA molecule having at least one mismatched region. By way of example, the 
O DNA molecule may be a linear DNA molecule, a circularized DNA molecule such as a plasmid 

^ or a viral genome, a chromosome, a cDNA generated by reverse transcription of an RNA 

molecule, a PGR primer, a PGR product, a complex formed between a single-stranded DNA 
^^0 probe and another single-stranded DNA molecule, and the like. The mismatched region may be 

M 15 any region of a duplex DNA molecule in which the two DNA strands of the molecule are not 
'!m completely complementary. By way of example, the mismatched region may comprise one or 

■jp more pairs of mismatched nucleotides in an otherwise complementary region of a duplex DNA 

lU molecule, a region of a duplex DNA molecule wherein a thymine dimer exists on one DNA 

strand of the molecule, a region of a duplex DNA molecule comprising a nucleotide which has 
20 been covalently modified by an agent capable of reacting with a nucleotide, such as cisplatin, a 

region of a duplex DNA molecule which comprises an alkyl-O-6-methyl guanine residue, a 

region of a duplex DNA molecule which comprises a single stranded loop of one or more 

nucleotides, a region of a duplex DNA molecule which comprises a pyrimidine dimer, and the 

like. 

25 [0104] While any amount of ADP can be used in the binding solution of the method of the 
invention, it is preferred that the homolog be contacted with the mismatched duplex DNA in the 
presence of a binding solution comprising at least about 100 nanomolar ADP, preferably at 
least about 6 micromolar ADP, and more preferably at least about 60 micromolar ADP. As 
described with greater particularity in Example 1, ATP displaces ADP from the MSH dimer 

30 when the dimer is associated with a mismatched region of duplex DNA. Thus, it is important 
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either that the concentration of ATP in the solution be minimized, for example by maintaining 
the concentration of ATP lower than about 3 micromolar, preferably lower than about 0.3 
micromolar, and more preferably lower than about 10 nanomolar, or that the ratio of the 
concentration of ADP in the solution to the concentration of ATP in the solution be greater than 
5 a minimum value, such as about two, and preferably greater than about eight, and even more 
preferably greater than about sixteen. Preferably, the solution is substantially free of ATP or 
the ratio of ADP to ATP is much greater than sixteen (e.g. [ADP]:[ATP] is 100:1 or greater). 
[0105] It is understood that gamma-hydrolysis-resistant ATP analogs, certain other ATP 
analogs, and other ADP analogs may be boimd to an MSH dimer, and that these analog-bound 
10 dimers will associated with mismatched duplex DNA. By way of example, MSH2:MSH6 

O dimers will associate with mismatched DNA in the presence of either ATP-PNP or ATP-PCP. 

m [0106] The MSH dimer that is useful in the compositions, kits, and methods of the 

invention may be used in a variety of states of purity or isolation. For example, the dimer may 
be present in a liquid which a variety of other proteins, nucleic acids, lipids, single stranded 
15 nucleic acids, non-mismatched duplex DNA, and the like, it being understood that if the dimer 

iin is used in the form of a mismatched duplex DNA-containing liquid then it may be necessary to 

dissociate, and possibly to separate, the dimer from the mismatched DNA prior to using it in the 

W compositions, kits, and methods of the invention. Preferably, the MSH dimer is substantially 

purified. 

20 [0107] In many of the compositions, kits, and methods of the invention, the MSH dimer or 
the mismatched duplex DNA may bound to a support. Furthermore, each of the MSH dimer 
and the mismatched duplex DNA may be bound to different supports. 

[0108] The MSH dimer or a MutS homolog of the dimer may be bound to a support using 
any known method for attaching a protein to a surface. For example the MutS homolog may be 

25 bound to a support by way of an antibody which is covalently bound to the support and which 
has a variable region which specifically binds to the MutS homolog. By way of example, an 
antibody which specifically binds to hMSH2 such as the antibody described by Kinzler et al. 
(PCT publication number W096/41 192) may be used to bind an hMSH2 protein dimer or a 
complex comprising an hMSH2 protein molecule and either an hMSH3 protein molecule or an 

30 hMSH6 protein molecule to a support to which the antibody is fixed. Methods of fixing an 
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antibody to a support have been described in the art (e.g. Harlow et al, 1988, Antibodies: A 
Laboratory Manual, Cold Spring Harbor, New York). Alternately, covalent, ionic, 
hydrophobic, or other types of bonding forces may be used to attach an MSH dimer or a MutS 
homolog to a support. 

5 [0109] The duplex DNA molecule may be bound to a support using any known method for 
attaching a nucleic acid to a support. By way of example, the nucleic acid may be covalently 
linked to a biotin molecule and the support may be linked to or coated with a streptavidin 
molecule, whereby the streptavidin molecule is capable of binding the biotin molecule, thereby 
linking the nucleic acid to the support. Further by way of example, the duplex DNA may be 
10 covalently attached to a chemical substituent present on a surface of the support. Alternately, 
^5 covalent, ionic, hydrophobic, or other types of bonding forces may be used to attach the duplex 

hQ DNA to the support. 

P [0110] Supports to which an MSH dimer, a MutS homolog, or a duplex DNA molecule 

may be bound include any support known in the art for use in in vitro or in vitro biochemical or 
ifl 15 medical applications. By way of example, and not limitation, such supports include latex and 
13 other polymeric beads, particles, plates, supports, chromatography media, implants, drug 

jlp delivery vehicles, metal and glass surfaces, gelatinous surfaces such as agarose, alginates, and 

W polyacrylamides, and the like. It is important that the ability of the MutS homolog monomers 

or MSH dimers which are bound to the support be attached in such a way that the ability of the 
20 monomers to dimers to attain altered conformations is not significantly hindered. It is 

understood that, for example, by isolating antibodies which specifically bind to various epitopes 
on the monomer or dimer surface, a variety of antibodies may be isolated an used to bind 
monomers or dimers to a support. By assaying the ability of the support-bound monomers or 
dimers to bind to mismatched DNA in the presence of ADP, for example as described herein, 
25 an antibody or other support which attaches the monomers or dimers to a support without 

hindering their ability to bind mismatched duplex DNA may be identified. Such methods are 
routine in the art of protein immobilization and are not further described herein. 
[0111] As disclosed herein, after an ADP-bound MSH dimer binds to a mismatched region 
of a duplex DNA, exchange of ATP for the ADP bound to the dimer results in release of the 
30 dimer from the mismatched region, whereby the ATP-bound dimer is enabled to diffuse to a 
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different position on the DNA. If the dimer is able to diffuse to a free end of the duplex DNA, 
the dimer may dissociate from the duplex DNA. 

[0112] If the duplex DNA does not have a free end, then ATP-bound dimer may diffuse 
away from the mismatched region of the duplex DNA, but may not dissociate from the DNA. 
5 Thus, if the duplex DNA does not have a free end, a plurality of copies of the dimer may be 
associated v^^ith the DNA in the presence of ATP. No upper limit is known for the number of 
dimers which may be associated with the DNA, but it is contemplated that this number is 
roughly proportional to the length of the duplex DNA. It is understood that association of 
multiple copies of an MSH dimer with a mismatched duplex DNA may be advantageous in 
10 situations in which association of the dimer with the DNA is to be detected. Multiple copies of 
C3 the dimer may boost the detection limit of the DNA to be detected, increasing the signal-to- 

,p noise ratio of the detection method. 

[0113] Duplex DNA not having a free end may be circular DNA or it may be linear DNA 
;fi wherein both ends of the DNA are blocked. Ends of duplex DNA may be blocked by binding 

15 bulky moieties such as proteins to the DNA either directly (e.g. by covalently attaching the 
iin protein to the DNA or by binding the protein to the DNA non-covalently with high affinity) or 

via a linker (e.g. by biotinylating the DNA and binding an avidin such as streptavidin to the 
M biotin moiety). Bulky moieties which may be used to block the ends of duplex DNA include, 

[y.- but are not limited to, proteins, supports, hairpin DNA structures, stem-and-loop DNA 

20 structures, and multiple a stem-and-loop DNA structures. Association of MSH dimers with 
DNA having one, two, or no free ends is expressly contemplated. 

[0114] The mismatched duplex DNA to which an MSH dimer is to be bound may, for 
example, comprise a first DNA strand having a reference nucleotide sequence and a second 
DNA strand selected from the group consisting of a DNA strand obtained from an organism, a 

25 DNA strand obtained by amplification of at least a portion of a polynucleotide obtained from an 
organism, a DNA strand obtained by cleavage of a polynucleotide obtained from an organism, 
and a DNA strand obtained by reverse transcription of a polynucleotide obtained from an 
organism. By way of example, the second DNA strand may comprise at least a portion of a 
gene associated with a cancer in the organism. This gene may, for example, be any of a number 

30 of oncogenes and tumor suppressor genes which are known in the art. Examples of such genes 
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include, for example, abl, akt2, ape, bcl2alpha, bcl2beta, bcl3, bcr, brcal, brca2, cbl, ccndl, 
cdk4, crk-II, csflr/fms, dbl, dec, dpc4/smad4, e-cad, e2fl/rbap, egfr/erbb-l, elkl, elk3, eph, erg, 
etsl, ets2, fer, fgr/src2, flil/ergb2, fos, fps/fes, fral, fra2, fyn, hck, hek, her2/erbb-2/neu, 
her3/erbb-3, her4/erbb-4, hrasl, hst2, hstfl, ink4a, ink4b, int2/fgf3, jun, junb, jund, kip2, kit, 
kras2a, kras2b, Ick, lyn, mas, max, mcc, met, mlhl, mos, msh2, msh3, msh6, myb, myba, 
mybb, myc, mycU, my en, nfl, nf2, nras, p53, pdgfb, piml, pmsl, pms2, ptc, pten, rafl, rbl, rel, 
ret, rosl, ski, srcl, tall, tgfbr2, thral, thrb, tiaml, trk, vav, vhl, wafl, wntl, wnt2, wtl, and 
yesl. These genes are described in various publicly available databases, including the U.S. 
National Cancer Institute/National Center for Biotechnology Information Cancer Genome 
Anatomy Project database. Various accession numbers for these genes are listed in Table 1 . 
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Table 1 



Gene Symbol 


Entrez Accession 


PubMed UID 


UniGene CID 


CGAP 


ABL 


X16416 


90082420 


Hs.82576 


AA601510 


AKT2 


M95936 


93028445 


Hs.37433 


AA505663 


APC 


M74088 


91335210 


Hs.75081 


AA592971 


BCL2ALPHA 


M13994 


86259760 


Hs.89534 


AA577385 


BCL2BETA 


M13995 


86259760 


Hs.99916 




BCL3 


M31732 


90199880 


Hs.31210 


AA527996 


BCR 


Y00661 


85240564 


Hs.2557 


AA592930 


BRCAl 


U14680 


95025896 


Hs.66746 


AA484941 


BRCA2 


X95161 


96112016 


Hs.34012 


AA215820 


CBL 


X57110 


92228506 


Hs.99980 




CCNDl 


M64349 


91235304 


Hs.82932 


AA592929 


CDK4 


U37022 


8528263 


Hs.95577 


AA483705 


CRK-II 


D10656 


92334347 


Hs.l6 




CSFIR/FMS 


X03663 


86175013 


Hs.75116 


AA595091 


DBL 


X12556 


89052660 


Hs.89543 




DCC 


X76132 


95011532 


Hs.68149 




DPC4/SMAD4 


U44378 


96144684 


Hs.75862 


AA576881 


E-CAD 


Z13009 


93211394 


Hs.82004 


AA603448 


E2F1/RBAP 


M96577 


92346720 


Hs.89494 




EGFR/ERBB-1 


X00588 


84219729 


Hs.77432 


AA587386 


ELKl 


M25269 


89203250 


Hs.1399 


AA576028 


ELKS 


Z36715 


95047310 




AA262193 


EPH 


M18391 


88070650 


Hs.1113 




ERG 


Ml 7254 


87263429 


Hs.70388 




ETSl 


X14798 


89083219 






ETS2 


J04102 


89042086 


Hs.85146 


AA480196 


FER 


J03358 


89261786 




AA534773 
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FGR(SRC2) 


M12502 


85205090 


Hs.1422 




FLI1/ERGB2 


M98833 


93075640 


Hs.736 




FOS 


V01512 


83221560 


Hs.25647 


AA5 14238 


FPS/FES 


X06292 


86055727 


Hs.7636 




FRAl 


XI 6707 


90191709 


Hs.4245 




FRA2 


XI 6706 


90191709 


Hs.89765 


AA601534 


FYN 


M14333 


86287278 


Hs.75390 


AA524156 


HCK 


M16591 


87257942 


Hs.77058 




HEK 


M83941 


92179233 






HER2/ERBB-2 
/NEU 


X03363 


86118663 


Hs.46254 


AA508596 


HER3/ERBB-3 


M29366 


90083234 


Hs.82186 


AA570304 


HER4/ERBB-4 


L07868 


93189574 


Hs.1939 




HRASl 


V00574 


83141783 


Hs.37003 


AA483837 


HST2 


X63454 


92195660 






HSTFl 


J02986 


87204251 


Hs.1755 




INK4A 


L27211 


94081956 


Hs.1174 


AA557137 


INK4B 


L36844 


94359613 






INT2/FGF3 


X14445 


89239468 


Hs.37092 


AA525331 


JUN 


J04111 


89057892 


Hs.78465 


AA582267 


JUNB 


M29039 


90090625 


Hs.89792 


AA503220 


JUND 


X56681 


91232849 


Hs.2780 


AA533575 


KIP2 


D64137 


96209909 


Hs.9039 


AA524076 


KIT 


X06182 


88111521 


Hs.81665 


AA552932 


KRAS2A 


L00045 


83271513 






KRAS2B 


X01669 


85087906 






LCK 


X13529 


89123626 


Hs.1765 


AA282059 


LYN 


M16038 


87172710 


Hs.80887 


AA524487 


MAS 


M13150 


86218084 


Hs.99900 
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MAX 


M64240 


91173288 


Hs.89500 


AA592936 


MCC 


M62397 


91164855 


Hs.1345 




MET 


J02958 


87317655 


Hs.35379 




MLHl 


U07343 


8145827 


Hs.57301 




MOS 


J00119 


82275068 






MSH2 


U04045 


94084796 


Hs.78934 


AA502616 


MYB 


Ml 5024 


87092302 


Hs.1334 


AA535078 


MYBA 


X66087 




Hs.2537 


AA459003 


MYBB 


X13293 


89083548 


Hs.74605 


AA603093 


MYC 


X00196 


84131953 


Hs.79070 




MYCLl 


M19720 


88094386 


Hs.92137 




MYCN 


Y00664 


88202932 


Hs.25960 


AA548970 


NFl 


M89914 


90335969 


Hs.37170 


AA534609 


NF2 


L11353 


93201601 


Hs.902 


AA617825 


NRAS 


X02751 


85269641 


Hs.82602 


AA558915 


P53 


K03199 


85267676 


Hs.1846 


AA5 14357 


PDGFB 


M12783 


87217119 


Hs.1976 




PIMl 


M27903 


90382681 


Hs.81170 


AA251525 


PTC 


U59464 


8658145 


Hs.54503 




RAFl 


X03484 


86120351 


Hs.85181 


AA578685 


RBI 


Ml 5400 


87149066 


Hs.75770 


AA594282 


REL 


X75042 


89330980 


Hs.44313 


AA279536 


RET 


M16029 


87257826 


Hs.6253 




ROSl 


M34353 


90280463 


Hs.1041 




SKI 


X15218 


89345144 


Hs.2969 


AA258011 


SRCl 


M16243 


87257903 


Hs.65442 


AA523427 


TALI 


M29038 


90099309 


Hs.73828 


AA551582 


TGFBR2 


M85079 


92154690 


Hs.82028 


AA5 15322 


THRAl 


Y00479 


88067793 


Hs.724 


AA602782 



-37- 



THRB 


X04707 


87090375 




AA577807 


TIAMl 


X86351 


96129318 


Hs.3205 




TRK 


M23102 


89181575 


Hs.85844 




VAV 


X16316 


90005432 






VHL 


LI 5409 


93262488 


Hs.78160 




WAFl 


L25610 


94061996 


Hs.74984 


AA614342 


WNTl 


X03072 


86055728 






WNT2 


X07876 


89005063 


Hs.89791 


AA601910 


WTl 


X51630 


90158822 


Hs.1145 




YESl 


Ml 5990 


87172733 


Hs.75680 


AA502695 



[0115] In a preferred embodiment, the gene associated with a cancer is a gene associated 
with hereditary non-polyposis colon cancer. For example, the gene may be selected from the 
group consisting of mlhl, msh2, mshS, msh6, pmsl, and pms2. In another embodiment the 
gene may be a gene associated with a cancer selected from the group consisting of a leukemia, a 
lymphoma, a meningioma, a mixed tumor of a salivary gland, an adenoma, a carcinoma, an 
adenocarcinoma, a sarcoma, a dysgerminoma, a retinoblastoma, a Wilms' tumor, a 
neuroblastoma, a melanoma, and a mesothelioma. 

[0116] If an MSH dimer is contacted with a mixture of mismatched duplex DNA and non- 
mismatched duplex DNA, the dimer will preferentially associate with the mismatched duplex 
DNA. The mismatched duplex DNA is thereby labeled differently than the non-mismatched 
duplex DNA, and MSH dimer associated with mismatched duplex DNA may be detected as 
describe herein or separated from the non-mismatched duplex DNA, By separating the dimer 
from non-mismatched duplex DNA, the mismatched duplex DNA bound to the dimer is 
separated from the non-mismatched duplex DNA. Furthermore, mismatched duplex DNA may 
be dissociated from the dimer after separating it from the non-mismatched duplex DNA. 
[0117] Methods of detecting an MSH dimer associated with mismatched duplex DNA 
include, but are not limited to, electrophoretic gel mobility shift assays, HPLC and other 
column and thin layer chromatographic methods, filter binding assays, immunologic detection 
methods such as ELISA, tagged antibody, and precipitation assays, centrifixgal sedimentation 
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methods, optical affinity sensing, 'footprinf and other nucleolytic cleavage protection assays, 
and spectroscopic assays. 

[01 18] In a preferred method of detecting specific binding of the MutS homolog to the 
duplex DNA molecule, an optical affinity biosensor system (OABS) is used to detect specific 

5 binding. In an OABS system such as the lASYS™ system (Affinity Sensors, Cambridge, 

United Kingdom), binding and dissociation events can be detected as one molecule in solution 
binds to or dissociates fi*om another molecule immobilized on a detector surface of the system. 
Thus, an OABS may be used to detect specific binding between an MSH dimer and a 
mismatched duplex DNA in any of the methods of the invention by immobilizing either the 

10 MSH dimer or the mismatched duplex DNA on the detector surface of the OABS. Specific 
binding may be differentiated from non-specific binding by comparing binding of an MSH 
dimer to a duplex DNA molecule known to comprise a mismatched region and binding of the 
homolog to a duplex DNA molecule knovra not to comprise a mismatched region. 
[0119] By way of example, the separation of a mismatched duplex DNA from a population 

15 of duplex DNAs may be achieved by binding an MSH dimer to a support, contacting the 

support with the population of duplex DNAs, and rinsing the support with a separating solution 
which does not comprise the population of duplex DNAs. If the mismatched duplex DNA has a 
free end, then the separating solution is preferably substantially free of ATP. In this example, a 
mismatched duplex DNA in the population of duplex DNAs binds to the MSH dimer and 

20 thereby becomes associated with the support. The mismatched duplex DNA is segregated from 
the other duplex DNAs of the population by rinsing the support vrith the separating solution, 
which carries the non-mismatched DNA molecules away from the support. Thus, according to 
this example, the mismatched duplex DNA is physically separated from the non-mismatched 
duplex DNAs of the population. 

25 [0120] It is not necessary that the just-described method result in separation of the 

mismatched duplex DNA from the population such that the molecule and the population are 
contained in different containers at the conclusion of the method. By way of example, it is 
sufficient in the OABS described herein that a mismatched duplex DNA comprising a region 
associate with the detector surface of the OABS and that non-mismatched duplex DNAs do not 

30 associate with the detector surface of the OABS. Thus, for example, in OABS methods for 
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detection of mismatched duplex DNAs, an MSH dimer may be associated with the detector 
surface of the OABS, whereby a mismatched duplex DNA binds to the homolog in the presence 
of ADP and is detected, and whereby a non-mismatched duplex DNA does not bind appreciably 
to the dimer and is not detected. 
5 [0121] Mismatched duplex DNA may be dissociated from an MSH dimer after separating 
the MSH dimer from a population comprising the mismatched duplex DNA and non- 
mismatched duplex DNAs. The mechanism by which this dissociation may be achieved 
depends upon whether or not the duplex DNA has a free end. 

[0122] If the duplex DNA has a free end, then the an MSH dimer may be dissociated from 
10 the duplex DNA by contacting the dimer-mismatched duplex DNA complex with a solution 
ii5 having a high salt concentration, with a solution comprising EDTA or another magnesium- 

chelating agent, or with a releasing solution comprising ATP. Preferably, such a releasing 
■ Jf; solution comprises at least about 0,3 micromolar ATP, more preferably at least about 3 

=^0 micromolar, more preferably at least about 30 micromolar ATP, and even more preferably 

hQ 15 much more than 30 micromolar ATP (e.g. 200 micromolar ATP or 500 micromolar ATP). If 
;k the mismatched duplex DNA has a free end, then the MSH dimer may be dissociated therefrom 

J^l simply by contacting the dimer with a solution comprising ATP, The MSH dimer may also be 

ly dissociated from the mismatched duplex DNA by contacting the dimer-mismatched duplex 

DNA complex with a gamma-modified ATP analog. 
20 [0123] If the mismatched duplex DNA does not have a free end, then an MSH dimer may 
be dissociated from the duplex DNA by contacting the dimer-mismatched duplex DNA 
complex with a solution which comprises high salt or EDTA or another magnesium-chelating 
agent. The dimer will not dissociate from the duplex DNA having no free end in the presence 
of ATP and magnesium ions (e.g. at least about 10 nanomolar Mg , preferably at least about 1 

9-1- 9+ 

25 micromolar Mg , and more preferably at least about 1 00 micromolar Mg . However, if a 
free end is generated on the mismatched duplex DNA, for example, by cleaving a circular 
DNA, by removing a blocking group from a blocked end of the DNA, or by cleaving the 
blocked end of the DNA, then the dimer will dissociate from the duplex DNA in the presence of 
ATP and magnesium ions. It is understood that there may be some situations in which 

30 association of MSH dimers is advantageous (e.g. separating DNA associated with MSH from 
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DNA not associated with MSH). In such situations, taking advantage of the property of MSH 
dimers to exchange ADP to ATP only when a mismatch is present will permit association of 
multiple copies of the MSH dimer with the DNA, effectively increasing the amount of MSH 
dimer which can be detected using one or more of the methods described herein. This increase 
5 may be particularly important where the detection limit of the assay is relatively low. 

[0124] Mismatched duplex DNA may be separated from a population of duplex DNAs by 
contacting the population and an MSH dimer and binding the MSH dimer to a support after 
contacting it with the population, but prior to separating the non-mismatched duplex DNA from 
the MSH dimer. 

10 [0125] It is understood that if acceleration of ATP displacement of ADP bound to an MSH 
O dimer or acceleration of ATP hydrolysis by MSH dimer not bound to duplex DNA is desired, 

^ the MSH dimer may be contacted with a MutL homolog to achieve this acceleration. It is 

't! furthermore understood that if the MSH dimer is present in molar excess with respect to the 

■C mismatched duplex DNA an average of more than one copy of the MSH dimer may be 

15 associated with individual copies of the mismatched duplex DNA if ATP is available to the 
r=i MSH dimer. The average number of copies of the MSH dimer associated with individual 

^ copies of the mismatched duplex DNA may be further increased by contacting the MSH dimer 

fli with a MutL homolog. Similarly, the average number of copies of the MSH dimer associated 

with individual copies of the mismatched duplex DNA may be increased by employmg 
20 solutions which favor formation of ADP-bound MSH dimer and displacement of ADP bound to 
mismatch-bound dimer by ATP, Such conditions include, but are not limited to, increasing the 
concentration ADP in the binding solution, increasing the concentration ATP, magnesium, or 
both, in the binding solution, and increasing the concentration of the dimer in the binding 
solution. 

25 [0126] The properties of MSH dimers described above can be employed in a variety of 
useful methods including, but not limited to the following. It is understood that other methods 
which usefully employ the methods described above may be devised by the ordinarily skilled 
worker in view of the teachings provided herein. 

[0127] The invention includes a method of segregating a mismatched duplex DNA from a 
30 population of DNA molecules. This method comprises contacting an MSH dimer and the 
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population in the presence of a binding solution comprising a nucleotide selected from the 
group consisting of ADP and ATP. In the presence of this binding solution, the MSH dimer 
associates with the duplex DNA. After contacting the dimer and the population, the MSH 
dimer is segregated from the population. The duplex DNA is thereby segregated from the 
5 population. 

[0128] The invention also includes a method of detecting a difference between a sample 
nucleotide sequence and a reference nucleotide sequence. This method comprises annealing a 
first DNA strand and a second DNA strand to form a duplex DNA. The first DNA strand has 
the sample nucleotide sequence, and the second DNA strand has a nucleotide sequence which is 

10 complementary to the reference nucleotide sequence. If there is a difference between the 

sample nucleotide sequence and the reference nucleotide sequence, then the duplex DNA will 
be a mismatched duplex DNA. After annealing the DNA strands, the duplex DNA and an MSH 
dimer are contacted in the presence of a binding solution as described herein. If the duplex 
DNA is a mismatched duplex DNA, then the MSH dimer associates with the duplex DNA. 

15 After contacting the duplex DNA and the MSH dimer, association of the MSH dimer with the 
duplex DNA molecule is detected as described herein. Association of the MSH dimer with the 
duplex DNA molecule is an indication that there is a difference between the sample nucleotide 
sequence and the reference nucleotide sequence. 

[0129] The invention fiirther includes a method of determining whether a mammal is 
20 predisposed for carcinogenesis. This method comprises annealing a first DNA strand and a 
second DNA strand to form a duplex DNA. The first DNA strand has the nucleotide sequence 
of at least a region of an oncogene or a tumor suppressor gene of the mammal, such as one of 
those described herein. The second DNA strand has a nucleotide sequence which is 
complementary to the consensus nucleotide sequence of this region. If there is a sequence 
25 difference between the first DNA strand and the second DNA strand, then the duplex DNA will 
be a mismatched duplex DNA. The duplex DNA is contacted with an MSH dimer in the 
presence of a binding solution as described herein. The MSH dimer associates with the duplex 
DNA if the duplex DNA is a mismatched duplex DNA. After contacting the duplex DNA and 
the MSH dimer, association of the MSH dimer with the duplex DNA molecule is detected as 
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described herein. Association of the MSH dimer with the duplex DNA molecule is an 
indication that the mammal is predisposed for carcinogenesis. 

[0130] The invention still further includes a method of fractionating a population of duplex 
DNAs. This method comprises contacting the population with an MSH dimer in the presence 
5 of a binding solution as described herein. The MSH dimer associates with any mismatched 
duplex DNA in the population. The MSH dimer is segregated from the population, and any 
mismatched duplex DNA from the population is segregated from the population. The 
population is thereby fractionated. 

[0131] The invention also includes a method of selectively amplifying at least one 
10 mismatched duplex DNA of a population of duplex DNAs. This method comprises contacting 
C3 the population with an MSH dimer in the presence of a binding solution as described herein. 

=3 The MSH dimer associates with the mismatched duplex DNA. The MSH dimer is segregated 

from the population, and the mismatched duplex DNA is thereby segregated from the 
]^ population. The mismatched duplex DNA is then amplified. 

15 [0132] The invention fiirther includes a method of determining whether the nucleotide 
Cj sequence of a first copy of a genomic sequence differs from the nucleotide sequence of a second 

m copy of the genomic sequence. This method comprises amplifying a region of each of the first 

?ii <^opy and the second copy of the genomic sequence to yield amplified first copies and amplified 

second copies. The amplified first copies and the amplified second copies are mixed and 
20 denatured to form a first mixture. The nucleic acids in the first mixture are annealed to form a 
second mixture comprising duplex DNAs. If the nucleotide sequence of first copy and the 
nucleotide sequence of the second copy of the genomic sequence differ, then at least some of 
the duplex DNAs in the second mixture are mismatched duplex DNAs. The second mixture is 
contacted with an MSH dimer in the presence of a binding solution as described herein. The 
25 MSH dimer associates with any mismatched duplex DNAs that are present in the second 

mixture. Association of the MSH dimer with duplex DNA is then detected. Association of the 
MSH dimer duplex DNA is an indication that the nucleotide sequence of the first copy of the 
genomic sequence differs from the nucleotide sequence of the second copy of the genomic 
sequence. The first and second copies of the genomic sequence may be obtained from a single 
30 eukaryotic organism or from different eukaryotic individuals of the same or a different species. 
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If the first and second copies of the genomic sequence are obtained from a single individual, 
one copy may be obtained from each of a pair of the individual's chromosomes. If the first and 
second copies of the genomic sequence are obtained from different individuals of the same 
species, then the individuals may, for example, be related, unrelated, or congenic. 
5 [0133] The invention yet further includes a composition for segregating a mismatched 
duplex DNA from a population of duplex DNAs, the composition comprises an MSH dimer 
bound to a support, and may be used in any of the methods described herein. The composition 
may be a component of a kit v^hich includes an instructional material which describes a method 
of the invention v^herein the composition is useful. The kit may instead comprise the 
10 composition and a binding solution or a releasing solution, as described herein. 

[0134] The invention also includes a kit for screening a genomic region for. a nucleotide 
sequence v^hich differs from a reference nucleotide sequence. This kit comprises a pair of 
primers complementary to the ends of the region. The pair of primers is useful for amplifying 
the region. The kit further includes a DNA strand having the reference nucleotide sequence and 
15 at least one MutS homolog. The MutS homolog may be supplied in the form of an MSH dimer. 
The kit may be used to perform the methods described herein. The kit may further comprise 
additional components, such as an instructional material which describes use of the kit to 
W perform a method described herein, an assay reagent for detecting binding of a mismatched 

duplex DNA to the MSH dimer, or a reagent for blocking the ends of duplex DNAs. By way of 
20 example, the primers of the kit may be biotiny lated and the kit may further comprise an avidin 
such as streptavidin for blocking the ends of duplex DNA. 

[0135] The invention further includes a kit for separating a mismatched duplex DNA from 
non-mismatched duplex DNAs. This kit comprising at least one MutS homolog, a linker for 
binding the MutS homolog to a support, and an additional reagent selected from the group 
25 consisting of a nucleotide and a releasing solution, as described herein. The releasing solution 
may, for example, comprise a compound selected from the group consisting of ATP and a 
gamma-modified ATP analog. The kit may further comprise a reagent for blocking the ends of 
a duplex DNA, such as biotinylated PGR primers which can be used to amplify the duplex 
DNA, prior to contacting the biotinylated duplex DNA with an avidin such as streptavidin. 
30 Alternately, the kit may comprise a binding solution which is substantially free of ATP, 
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magnesium ions, or both, whereby when a support-bound MSH dimer binds a mismatched 
duplex DNA, the dimer is not able to bind ATP and magnesium ion, and thus cannot exhibit 
DNA-associated diffusion and the duplex DNA remains bound to the ADP-bound dimer. 
[0136] The invention further includes a nonhuman mammal which is nuUizygous for both 
5 Msh2 and p53. The nonhuman mammal does not express Msh2 or p53 and exhibits a 
phenotype selected from the group consisting of inappropriate fetal apoptosis and a 
predisposition for carcinogenesis. Preferably, the mammal is a mouse, but other non-human 
mammals may also be generated using the teaching provided herein. 

[0137] The invention still further includes a method of making a nonhuman mammal which 
10 is nuUizygous for both Msh2 and p53. Such a mammal does not express Msh2 or p53 and 
{5 exhibits a phenotype selected from the group consisting of a predisposition for inappropriate 

, 1 fetal apoptosis and a predisposition for carcinogenesis. Such mammals are made by mating a 

first parent mammal comprising at least one null allele of Msh2 and at least one null allele of 
'^D p53 and a second parent mammal comprising at least one null allele of Msh2 and at least one 

hQ 15 null allele of p53. The offspring of the two parent mammals inherit the null alleles of these two 
J^^j genes according to normal allelic segregation rales (i.e. generally speaking, most mammals will 

ji^ randomly inherit one of each parent's two alleles of a gene). Thus, the proportion of nonhuman 

111 mammals which are nuUizygous for both Msh2 and p53 will depend upon the allelic 

composition of the parents. Offspring which are nuUizygous for both Msh2 and p53 do not 
20 express Msh2 or p53 and exhibit a phenotype selected from the group consisting of 

inappropriate fetal apoptosis and a predisposition for carcinogenesis. Further details relating to 

this method are described herein, such as in Example 2. 

[0138] The invention also includes several screening methods, all of which make use of the 
properties of the Msh2''^"p53"^" mice described herein. 
25 [0139] A standard screening procedure is now described which is useful for determining the 
tumorigenesis-, apoptosis-, aging-, or fetal development-modulating potential of a compound. 
While this procedures is described with respect to particular protocols and mice, it will be 
appreciated that the screening procedure described should not be construed to Umit the 
invention in any way. 
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[0140] Msh2"^'p53'^" mice are generated as described herein or obtained jfrom a producer of 
such mice. A predetermined amount of the compound is administered to a Msh2'^"p53'^" 
mouse by any practical means. The method of administration of the compound is not critical. 
By way of example, the compound may be administered orally, intraperitoneally, intravenously, 
topically, intramuscularly, or via a pulmonary route. 

[0141] Following administration of the compound, the Mshl'^'pSS""^" mouse, each Msh2"^" 
p53"^' mouse is observed for about four months. Each mouse is examined approximately daily. 
Every week, each mouse is weighed, observed for any clinically-relevant symptoms, and the 
number and extent of tumors are assessed. 

[0142] To reduce any potential for bias, the study is blinded. A first investigator treats all 
mice with compound(s) and identifiably marks or cages the transgenic mice, so that the nature 
of the treatments will not be known to a second investigator, who performs all tumor counts, 
weighing, and general observations. 

[0143] If the mice are being used to screen for tumorigenesis-modulating compounds, then 
after observations are completed, the rate of tumor incidence and the tumor yield are 
determined for each group of Msh2"^'p53"^' mice to which the compound was appUed. A 
higher or lower rate of tumor incidence or a higher or lower tumor yield for a group of Msh2"'^' 
p53''^" mice to which the compound was applied, compared with the levels of tumor incidence 
and tumor yield for a group of Msh2"^°p53"^' mice to which the compound was not applied, is 
an indication that the compound affects tumorigenesis. 

[0144] If the mice are being used to screen for apoptosis-modulating compounds or fetal 
development-modulating compounds, then the mice are preferably administered the compound 
and observed during fetal development. After observations are completed, the prevalence of 
inappropriate fetal apoptosis and the fetal survival rate are determined for each group of Msh2" 
^"p53"^" mouse embryos to which the compound was applied. A higher or lower mouse 
embryos or a higher or lower fetal survival rate for a group of Msh2"^"p53"^"mouse embryos to 
which the compound was applied, compared with the mouse embryos and fetal survival rate for 
a group of Msh2"^"p53"^"mouse embryos to which the compound was not applied, is an 
indication that the compound affects apoptosis or fetal development. 
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[0145] If the mice are being used to screen for aging-modulating compounds, then after 
observations are completed, the prevalence of at least one symptom of aging (e,g. graying of 
hair, other changes in coat color, lethargy, or hair loss) are determined for each group of Msh2" 
^"p53"^" mice to which the compound was applied. A higher or lower prevalence of a symptom 
5 of aging for a group of Msh2"^"p53"^" mice to which the compound was applied, compared with 
the prevalence of the symptom for a group of Msh2"^"p53'^' mice to which the compound was 
not applied, is an indication that the compound affects aging. 

[0146] Preferably, groups of Msh2"^"p53"^" mice or embryos are used, with each mouse in a 
group being treated identically. Also preferred are studies in which one of at least three 
10 different dose levels of the compound are applied to the mice or embryos in each of at least 

three corresponding groups of transgenic mice. It is preferred, where possible, to demonstrate a 
statistically significant difference (P < 0.05) between the observed phenotype for the first dose 
level and the observed phenotype for the third dose level. 

[0147] A cell line may be made using cells obtained from a Msh2"^"p53"^" mouse of the 
15 invention. Methods of making a cell line from a cell of a nonhuman animal are well known in 
the art. 

[0148] The invention also includes a method of determining whether a composition 
interferes with the activity of one of the p53 gene or one of its expression products and a MutS 
homo log gene or one of its expression products. According to this method, non-human 

20 mammals such as mice are generated which are nuUizygous for one of the p53 gene and the 

gene encoding the MutS homolog. These nuUizygous animals are crossed to generate embryos 
which are also nuUizygous for the same gene. The embryos are contacted with the composition, 
either in vitro or in utero, and the effects of contacting the embryos with the composition are 
observed. Increased mortality among the embryos, particularly among the female embryos, is 

25 an indication that the composition is able to interfere with the activity of the other of the p53 
gene or one of its expression products and a MutS homolog gene or one of its expression 
products. Thus, the ability of a composition to increase female embryonic lethality in mouse 
embryos which are nuUizygous for the p53 gene is an indication that the composition interferes 
with the activity of a MutS homolog gene or one of its expression products. Similarly, the 

30 ability of a composition to increase female embryonic lethality in mouse embryos which are 
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nullizygous for a MutS homolog gene is an indication that the composition interferes with the 
activity of the p53 gene or one of its expression products. Preferably, female embryos are 
selected and used. Also preferably, female embryonic lethality is observed at about 9.5 days 
gestation. Methods of generating both nulUzygous p53 animals such as mice and nullizygous 
5 msh gene animals such as mice have been described in the art. 

[0149] The invention further includes a composition comprismg a human MutS homolog 
fragment, wherein the fragment comprises a MutS homolog interaction region. The fragment 
may be a polypeptide having as many as all but one amino acid residues of the correspondmg 
MutS homolog. The interaction region may be any of the MutS homolog interaction regions 
10 described herein or a MutS homolog interaction region having significant homology thereto. 
O By way of example, a MutS homolog interaction region having significant homology to a MutS 

h§ homolog interaction region described herein may exhibit at least abovit 50%, and preferably at 

J least about 70%, 85%, 95%, or 99% homology with a MutS homolog interaction region 

j£ described herein. Thus, by way of example, the interaction region may be completely or 

^^0 15 significantly homologous to amino acid residues 378-625 of hMSH2, amino acid residues 875- 
G 934 of hMSH2, amino acid residues 126-250 of hMSH3, amino acid residues 1050-1 128 of 

* hMSH3, amino acid residues 326-575 of hMSH6, or amino acid residues 1302-1360 of 

W hMSH6. 

[0150] The composition comprising a human MutS homolog fragment of the invention is 
20 useful in a method of inhibiting association of a first human MutS homolog and a second 

human MutS homolog. This method comprises contacting at least one of the first human MutS 
homolog and the second human MutS homolog with the human MutS homolog fragment of the 
invention. Without wishing to be bound by any particular theory of operation, it is believed 
that the fragment will interact with at least one interaction region of one human MutS homolog, 
25 thereby preventing that homolog from associating with the other MutS homolog. Such 

compounds would have utility for inducing apoptosis in animal cells (e.g. human tumor cells) 
which harbor one or more mutations in their p53 genes. Such compounds would also be useful 
for sensitizing animal cells which harbor one or more mutations in their p53 genes for further 
treatment using, for example, DNA-damaging agents. 



-48- 



[0151] As described herein in Example 5, cDNA encoding hMSH5 has been discovered, 
and a protein encoded by that cDNA has also been discovered. hMSH5 may be purified in a 
manner directly analogous to the methods described herein (e.g. by his-tagging) or by other 
methods v^ell known in the art. The invention thus includes substantially purified hMSH5 and 
an isolated nucleic acid encoding hMSH5. 

[0152] The invention is now described with reference to the following Examples. These 
Examples are provided for the purpose of illustration only and the invention should in no way 
be construed as being limited to these examples, but rather should be construed to encompass 
any and all variations which become evident as a result of the teaching provided herein. 

Example 1 

The Human Mismatch Recognition Complex 
hMSH2:hMSH6 Functions as a Molecular Switch 

[0153] Adenine nucleotide binding by the human hMSH2:hMSH6 mismatch recognition 
complex functions as a novel molecular switch. The hMSH2:hMSH6 heterodimer is "ON" (i.e. 
it associates with mismatched DNA) in the ADP-bound form, and "OFF" (i.e. it is not capable 
of associating with mismatched DNA v^th which it is not already associated) in the ATP-bound 
form. The data presented herein establish that the switch is 'turned OFF' by displacement of 
complex-bound ADP by ATP. ATP-bound complex is recycled to the ADP-bound form, which 
is capable of binding to mismatched DNA, by intrinsic ATPase activity of the complex. 
[0154] The materials and methods used in the experiments presented in this Example are 
now described. 

Overexpression and purification of hMSH2:hMSH6 

[0155] Clones encoding hMSH2 and those encoding hMSH6 have been described (Acharya 
et al., 1996, Proc. Natl. Acad. Sci. USA 93:13629-13634; Fishel et al., 1993, Cell 75:1027- 
1038). In the experiments described herein, the clone encoding hMSH6 was modified to further 
encode six histidine residues at the amino terminus of the hMSH6 protein molecule. hMSH3 
can be similarly modified and isolated. 



-49- 



[01 56] hMSH2 and hMSH6 were overexpressed in SF9 insect cells using the pFastBac™ 
dual expression vector (Gibco BRL, Grand Island, NY) as described in the BAC-TO-BACtm 
baculovirus expression systems protocol (Gibco BRL, Grand Island, NY). Briefly, SF9 cells 
suspended in approximately 400 milliliters culture medium were infected using the vector, and 

5 were then cultured for 48 hours to achieve a cell density of approximately 1 0^ SF9 cells per 
milliliter. The cells contained in 200 milliliter aliquots of SF9 cells were harvested by 
centrifogation at 200 x g, resuspended in 10 milliliters of buffer A, and frozen at -80°C. Buffer 
A comprised 300 millimolar NaCl, 20 millimolar imidazole, 25 millunolar HEPES buffer 
adjusted to pH 7.8 using NaOH, 10% (v/v) glycerol, 0.5 millimolar 

10 phenylmethylsulfonylfluoride (PMSF), 0.8 micrograms per milliliter pepstatin, and 0.8 

0 micrograms per milliliter leupeptin. 

[0157] Cell extracts were prepared by thawing the cells, passing the cells through a 25 
' ji gauge needle, and then ultracentrifiiging the extract at 40,000 rotations per minute in a 

J'S Beckman Ti60 rotor for 70 minutes, according to known methods. About 1 00 milliliters of 

1 15 infected cells yielded approximately 2 milligrams of hMSH2:hMSH6 protein complex. All of 

the following protein purification procedures in this Example were carried out at 4°C. 

iS [0158] The supernatant was applied to a 2 milliliter nickel-NTASUPERFLOWT^^cohmm 

(Qiagen, Chatsworth, CA) at a flow rate of 0. 1 5 milliliters per minute using a Pharmacia FPLC 

C system. The column was washed by passing 3 5 milliliters of buffer A through the column. 

20 After washing the column, the hMSH2:hMSH6 heterodimer was eluted by applying 30 

milliliters of buffer A comprishig a Imear gradient of imidazole to the column and collecting 
the eluent from the column in fractions, wherein the concentration of imidizole was varied from 
20 millimolar to 200 millimolar. The hMSH2:hMSH6 heterodimer eluted in fractions 
containing approximately 70 millimolar imidizole. 
25 [0159] Fractions from the nickel-NTA column which contained peak amounts of the 

heterodimer were loaded at a flow rate of 0.2 miUiliters per minute directly onto a 1 milliliter 
PBE 94 column (a polybuffer exchange column obtained from Pharmacia, Upsala Sweden) 
which had been equilibrated with buffer B. Buffer B comprised 300 milUmolar NaCl, 25 
millimolar HEPES buffer adjusted to pH 7.8 using NaOH, 1 millimolar dithiothreitol (DTT), 
30 0.1 millimolar ethylenediaminetefraacetic acid (EDTA), 10% (v/v) glycerol, 0.5 millimolar 
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PMSF, 0.8 micrograms per milliliter pepstatin, and 0.8 micrograms per milliliter leupeptin. 
The PBE 94 colmnn was washed by passing 10 milliliters of buffer B through the column. 
After washing the colunm, the hMSH2/hMSH6 complex was eluted by applying 20 milliliters 
of buffer B comprising a linear gradient of NaCl to the column and collecting the eluent from 
the column in fractions, wherem the concentration of NaCl was varied from 300 millimolar to 1 
molar. The hMSH2:hMSH6 heterodimer eluted from the PBE 94 colunm in fractions 
containing approximately 575 millimolar NaCl. 

[0160] Fractions collected from the PBE 94 column which contained peak amounts of the 
heterodimer were dialyzed twice for two hours against 2 liters of a solution comprising 100 
millimolar NaCl, 25 millimolar HEPES buffer adjusted to pH 7.8 using NaOH, 1 millimolar 
DTT, 0.1 millimolar EDTA, and 20% (v/v) glycerol. Aliquots of the dialyzed solution 
containing the heterodimer were frozen using liquid nitrogen and stored at -80°C for several 
months without detectable loss of activity. 

[0161] hMSH2, hMSH6, and bovine serum albumin (BSA) contain nearly identical 
percentages (12%, 14%, and 13%, respectively) of arginine and heterocyclic amino acids, the 
amino acids known to interact with the Coomassie Brilliant Blue stain. Protein concentration in 
an aliquot comprising the hMSH2:hMSH6 heterodimer was determined by subjecting a portion 
of the aliquot to SDS-PAGE using a 6% (w/v) acrylamide gel, subjecting a known amount of 
BSA (Boehringer Mannheim, Indianapolis, IN) to SDS-PAGE using a 6% (w/v) acrylamide 
gel, staining the SDS-PAGE gels with Coomassie Brilliant Blue, and comparing the intensities 
of the protein bands m the gels to a BSA standard on a Coomassie stained 6% SDS PAGE to 
calculate protein concentration. The intensities of stained protem bands were measured using 
BioRad Gel Doc and MOLECULAR ANALYSTtm software. This protein quantitation method 
revealed the hMSH2 and hMSH6 proteins to be in near exact equimolar proportion in the 
heterodimer. 
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Preparation of 39- and 81 -base pair oligonucleotide probes 

[0162] The sequence of the 39-base pair oligonucleotide used in the experiments presented 
in this Example was: 5'-CGGCGAATTC CACCAAGCTT GATCGCTCGA GGTACCAGG-3' 
5 (SEQ ID NO: 1). The homologous 39-base pair DNA substrate used in the experiments 
presented in this Example was made by annealing the 39-base pair oUgonucleotide with an 
oligonucleotide (SEQ ID NO: 2) which was completely complementary thereto. The G/T 
mismatched 39-base pair DNA substrate used in the experiments presented in this Example was 
made by annealing the 39-base pair oligonucleotide with an oligonucleotide (SEQ ID NO: 3) 
10 which was completely complementary thereto, except that the oligonucleotide contained a G 
a residue at the nucleotide position complementary to the T residue at position 20 (numbered in 

is the direction extending from the 5' end to the 3* end) of the 39-base pair oligonucleotide. SEQ 

'J ID NO: 2 and SEQ ID NO: 3 are listed in Figure 8. 

■fl [0163] The nucleotide sequence of the 8 1 -base pair oligonucleotide used m the experiments 

3 15 described in this Example was: 5'-AAAGCTGGAG CTGAAGCTTA GCTTAGGATC 

U ATCGAGGATC GAGCTCGGTG CAATTCAGCG GTACCCAATT CGCCCTATAG T-3' 

;;p (SEQ ID NO: 4). The homologous 81 -base pair DNA substrate used in the experiments 

W presented in this Example was made by annealing the 8 1 -base pair oUgonucleotide with an 

U oligonucleotide (SEQ ID NO: 5) which was completely complementary thereto. The G/T 

20 mismatched 8 1 -base pair DNA substrate used in the experiments presented in this Example was 
made by annealing the 81 -base pair oligonucleotide with an oligonucleotide (having the 
nucleotide sequence listed in SEQ ID NO: 6) which was completely complementary thereto, 
except that the oligonucleotide contained a T residue at the nucleotide position complementary 
to the G residue at position 41 (numbered in the direction extending from the 5' end to the 3' 
25 end) of the 81-base pair oligonucleotide. SEQ ID NO: 5 and SEQ ID NO: 6 are hsted in Figure 
8. 

[0164] -^^P-end-labeled DNA substrates were prepared by incubating single stranded 
oligonucleotides in the presence of T4 polynucleotide kinase (Promega Corp., Madison, WI) 
and [^^P]gamma-ATP (NEN Dupont, Wilmington, DE). Excess label was separated from the 
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labeled DNA substrates using a CENTRISEP™ column (Princeton Separations, Princeton, NJ) 
per the manufacturer's instructions. 

[01 65] Labeled DNA substrate was annealed with a single-stranded DNA molecule which 
was either completely complementary thereto or contained a single G/T mismatch. To anneal 
5 the labeled DNA substrate with the single-stranded DNA molecule, the labeled molecule was 
suspended in a solution comprising a 10-fold excess of the single-stranded DNA molecule, 10 
millimolar Tris buffer which had been adjusted to pH 7.5 using HCl, 100 millimolar NaCl, and 
1 millimolar EDTA. The suspension was heated to 95°C and then slowly cooled to 55°C and 
was maintained at this temperature for twelve hours. Single-stranded DNA was removed from 
10 the suspension by incubating the suspension with benzoylated naphthoylated DEAE cellulose 
O (END cellulose, Sigma Chemical Co., St. Louis, MO) for twenty minutes in the presence of a 

^ solution comprising 1 .5 molar NaCl, 20 millimolar Tris buffer which had been adjusted to pH 

7.5 using HCl, and 0.5 millimolar EDTA. BND cellulose was then pelleted by centrifuging the 
suspension for about five minutes using an Eppendorf bench-top centrifuge. Double-stranded 
^5 15 DNA, v/hich remained in the supernatant^ was separated from the BND cellulose by filtration 

and was then precipitated by adding ethanol to the supernatant. The double-stranded labeled 
ii^i DNA substrate was resuspended in a solution comprising 10 millimolar Tris buffer which had 

W been adjusted to pH 7.5 using HCl, 100 millimolar NaCl, and 1 millimolar EDTA, Single- 

Stranded DNA could not be detected in the solution, as assessed by 4% (w/v) native PAGE 
20 separation of the nucleotides in the solution. Non--^^P-labeled oligonucleotides were prepared 
using analogous methods. 

Gel mobility shift assays 

[0166] Gel mobility shift assays were performed by incubating a hMSH2:hMSH6 
25 heterodimer and 9 femtomoles of either the -^^P-labeled homologous 81 -base pair DNA 
substrate or the ^^P-labeled G/T-mismatched 81 -base pair DNA substrate in a buffer 
comprising 50 millimolar NaCl, 25 millimolar HEPES buffer which had been adjusted to pH 
7,5 using NaOH, 1 millimolar DTT, 0.01 millimolar EDTA, and 15% (v/v) glycerol. The 
buffer included 10 nanograms per microliter of poly dl-dC (Pharmacia LKB Biotechnology 
30 Inc., Piscataway, NJ). Poly dl-dC is an alternating nucleic acid polymer which does not 
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interfere with binding of the hMSH2:hMSH6 heterodimer to DNA. In certain experiments 
described herein, the incubation mixture further comprised selected concentrations of 
nucleotides or non-labeled DNA. In other experiments described herein, the incubation mixture 
further comprised 1 millimolar MgCl2 or 5 millimolar EDTA. Except as otherwise described 

5 herein, each incubation mixture had a volume of 20 microliters and was incubated for fifteen 
minutes at 37°C and then immediately placed on ice. Each incubation mixture was applied to a 
gel comprising 4% (w/v) polyacrylamide (29:1 ratio of acrylamide:bis-acrylamide) 4% (v/v) 
glycerol, 40 millimolar Tris acetate buffer (pH 7.8), and 1 millimolar EDTA. Electrophoresis 
was performed by applying 200 volts to the gel for two hours. Following electrophoresis, each 

10 gel was dried and quantitated using a phosphoimaging device obtained from Molecular 
Dynamics. 

Footprint assays 

[0167] Incubation of the hMSH2:hMSH6 heterodimer with ^^P-labeled DNA substrates 

32 

15 was performed as described for gel mobility shift assays, except that 1 8 femtomoles of P- 

labeled DNA substrate was used in each assay. Following incubation, 80 microliters of a buffer 
comprising 50 millimolar NaCl, 25 millimolar HEPES buffer which had been adjusted to pH 
7.8 using NaOH, 1 millimolar DTT, 10 nanograms per microliter poly dl-dC, 1.25 millimolar 
CaCl2, 3.1 millimolar MgCl2, 10% (v/v) glycerol, and 33 picograms per microliter DNase 

20 (Boehringer Mannheim, Indianapolis, IN) was added to each incubation mixture. The mixtures 
were incubated at 37°C for an additional three minutes, and then 0.7 milliliters of a solution 
having a pH of 5.2 and comprising 95% (v/v) ethanol and 1 80 millimolar sodium acetate was 
added to each mixture to halt the DNase reaction and to precipitate the nucleic acids present in 
the mixture. 

25 [0168] DNase-treated nucleic acids were resuspended in 4 microliters of a solution 

comprising 80%» (v/v) formamide, 10 millimolar NaOH, 1 millimolar EDTA, and 0.1 % (w/v) 
bromophenol blue. The suspension was heated at 90''C for five minutes and was applied to a 
gel comprising 8% (w/v) polyacrylamide (29:1 ratio of acrylamide:bis-acrylamide), 90 
millimolar tris-borate buffer (pH 8), and 2 milUmolar EDTA. Following electrophoresis for 2 

30 hours at 200 volts, each gel was dried and imaged on a phosphoimaging device. Individual 
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bases of the 81 -base pair DNA substrates were identified by Maxam-Gilbert sequencing 
reactions performed as described (Ausubel et al., 1994, Current Protocols in Molecular Biology, 
8th Ed., Janssen, ed., John Wiley & Sons, Inc., Boston). 

5 ATPase assays 

[0169] ATPase activity was measured in a reaction mixture comprising 20 microliters of 
Buffer P, 500 micromolar non-labeled ATP (except where indicated), and 16,5 nanomolar [" 
^^P]gamma-ATP. Buffer P comprised 40 millimolar HEPES which had been adjusted to pH 
7.8 using NaOH, 75 millimolar NaCl, 10 millimolar MgCl2, 1.75 millimolar DTT, and 0.075 
10 millimolar EDTA, and 15 % (v/v) glycerol. Steady state reaction measurements were made 
O using 60 nanomolar hMSH2:hMSH6 heterodimer and either 240 nanomolar homoduplex 39- 

=;p base pair DNA substrate or 240 nanomolar G/T mismatched 39-base pair DNA substrate. 

'|i Reaction mixtures were incubated at 37''C for thirty minutes, and the reaction was stopped by 

addition of 400 microliters of a solution comprising 10% (w/v) activated charcoal (Sigma 
hQ 15 Chemical Co., St. Louis, MO) and 1 millimolar EDTA, Charcoal was pelleted by centrifuging 
-rn the mixture at 10,000 rotations per minute for ten minutes. The P content of duplicate 100 

5^1 microliter aliquots of the supernatant was assessed by liquid scintillation. 

Ili [0170] Initial velocity measurements were made by incubating the hMSH2:hMSH6 

1^^; heterodimer for ten minutes at 25''C in a reaction mixture comprising one volume Buffer P 

20 containing no MgCl2, 200 nanomolar non-labeled ATP, and 16.5 nanomolar [-^^P]gamma- 

ATP, To start the reaction, an equal volume of buffer P comprising 20 millimolar MgCl2 and 1 
millimolar non-labeled ATP was mixed with the reaction mixture, which raised the MgCl2 and 
ATP concentrations to 10 millimolar and 500 micromolar, respectively. Aliquots were 
removed at selected times and electrophoresed as described herein. A control aliquot was 
25 removed and prepared for electrophoresis prior to addition of the MgCl2-containing Buffer P to 
the reaction mixture. 

ADP exchange assays 

[0171] The ADP-ATP exchange rate was determined in a reaction mixture which comprised 
30 Buffer Q, 2.3 micromolar [^H]-ADP, and 60 nanomolar hMSH2:hMSH6 heterodimer. Buffer 
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Q comprised 25 millimolar HEPES which had been adjusted to pH 7.8 using NaOH, 75 
millimolar NaCl, 10 millimolar MgCl2, 1 millimolar DTT, and 15% (v/v) glycerol. This 
reaction mixture was incubated for ten minutes at room temperature. 240 nanomolar G/T- 
mismatched 39-base pair DNA substrate was added to the reaction mixture, and the incubation 
5 was continued for an additional ten minutes. The final volume of the reaction mixture was 1 0 
microliters. The order of addition of DNA and ADP did not affect the kinetic results obtained 
using this assay. An equal volume Buffer Q comprising 1 millimolar non-labeled ATP was 
then added to the reaction mixture. Reactions were incubated at 25°C for a selected time and 
then halted by diluting the reaction mixture with 4 milliliters of an ice-cold stop buffer 
10 comprising 25 millimolar HEPES which had been adjusted to pH 7.8 using NaOH, 100 
G millimolar NaCl, and 10 millimolar MgCU. 

^2 [0172] Each halted reaction mixture was immediately filtered on a HAWP nitrocellulose 

membrane (Millipore, Bedford, MA) and washed thrice with 4 milliliters of the ice-cold stop 
buffer. Each filter was air dried and incubated overnight in a standard scintillation cocktail, 
^fi 15 Radioactivity retained on the filters was quantified using a Beckman scintillation counter. A 
G control reaction mixture was prepared by not adding the Buffer Q comprising 1 millimolar non- 

|i labeled ATP to the reaction mixture. The amount of [^H]-ADP retained on the membrane to 

^ which the control reaction mixture was applied was considered to correspond to the amount of 

|i radioactivity retained when 100% of the complex had [^H]-ADP bound thereto. 

20 

Thin Layer Chromatography (TLC) Analysis 

[0173] TLC was used to determine the composition of an ATPase reaction mixture which 
was prepared as described herein in the presence of the G/T-mismatched 39-base pair DNA 
substrate, 15 micromolar ATP, and 0,01 micromolar [^^P] alpha- ATP and which was permitted 
25 to react for twenty minutes at 37°C. TLC was performed as previously described (Fishel et al., 
1988, Proc. Nati. Acad. Sci. USA 85:36-40). 

[0174] The results of the experiments presented in this Example are now described. 
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Overexpression and purification of the hMSH2-hMSH6 protein complex 
[01 75] hMSH2 and hMSH6 proteins were overexpressed in insect cells using a dual 
expression baculovirus vector, as assessed by the SDS-PAGE analysis of proteins obtained 
from cell extract. Co-expression of hMSH2 and hMSH6 proteins resulted in formation of a 

5 completely soluble hMSH2;hMSH6 heterodimer. Independent expression of either protein 
alone resulted in formation of a substantial amount of insoluble protein product, hMSH2 and 
hMSH6 likely exist together as a highly stable complex in vivo, as judged by the results 
obtained in the experiments described in this Example, the ability of investigators to co-purify 
these two proteins from human cells (Drummond et al, 1995, Science 268:1909-1912), and the 

10 ability of these two proteins to interact in vitro (Acharya et al, 1996, Proc, Natl. Acad. Sci. 
USA 93:13629-13634). 

[0176] Purification of hMSH2 and hMSH6 from insect cells indicated that a stable 
heterodimer of the two proteins had been formed. Quantitative densitometry of Coomassie- 
stained products consistently revealed that the hMSH2 and hMSH6 subunits were present in an 
15 equimolar ratio, as was observed with the yeast MSH2:MSH6 protein complex (Alani et al,, 

1997, Mol. Cell Biol. 17: 2436-2447), The purification methodology described herein yielded a 
protein preparation which was more than 95% homogeneous, which exhibited high 
MSH2/MSH6 activity, and which appeared to be free of any contaminating nucleic acid or 
nucleotide. 

20 

G/T mismatch binding by hMSH2:hMSH6 is a model for mismatch recognition 
[0177] The hMSH2:hMSH6 heterodimer has been demonstrated herein and by others to 
bind to the eight possible mismatched nucleotide combinations, as well as to a subset of single 
nucleotide insertion/deletion mismatches (Acharya et al., 1996, Proc. Natl, Acad. Sci. USA 

25 93:13629-13634; Drummond et al., 1995, Science 268:1909-1912; Hughes et al., 1992, J. Biol. 
Chem. 267:23876-23882). The G/T mismatch was chosen as a model for quantitative analysis 
of hMSH2:hMSH6 mismatch binding because of its apparently intermediate-to-high 
recognition specificity, as indicated, for example, by the data presented in Figures lA-lD. 
[0178] The apparent dissociation constant (Kj) was determined in a simple buffer system 

30 comprising neither an adenine nucleotide nor magnesium using the homologous 8 1 -base pair 
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DNA substrate and the G/T-mismatched 81 -base pair DNA substrate described herein. Results 
obtained using both gel shift assays, as depicted in Figure 1 A, and DNase footprint assays, as 
depicted in Figure IC, indicated that of the hMSH2:hMSH6 heterodimer for G/T 
mismatches was 20 ± 5 nanomolar. Binding of non-mismatched DNA to the heterodimer was 
5 not saturable, even at homoduplex concentrations greater than 400 nanomolar. 

[0179] The binding of the hMSH2:hMSH6 heterodimer to a G/T mismatch is at least ten 
times more efficient than binding of hMSH2 alone to the G/T mismatch (Fishel et al., 1994, 
Science 266:1403-1405; Fishel et al., 1994, Cancer Res. 54:5539-5542; Mello et al., 1996, 
Chemistry & Biology 3:579-589). This observation indicates that formation of the 
10 hMSH2:hMSH6 heterodimer enhances both the affinity and the specificity of hMSH2-binding 

0 to mismatched DNA (Acharya et al., 1996, Proc. Natl. Acad. Sci. USA 93:13629-13634). 

[0180] Gel mobility shift assays performed using the G/T-mismatched 39-base pair DNA 

'P substrate described herein or using the G/T-mismatched 81 -base pair DNA substrate and a 

'^0 buffer comprising 2 millimolar MgCl2 yielded results similar to those shown in Figure 1 A. 

£ 15 The hMSH2:hMSH6 heterodimer appears to bind G/T mismatched DNA in muhiple forms 

m which are differentiable by gel mobility shift assay, 

[0181] DNase footprint analysis of hMSH2:hMSH6 heterodimer binding to the G/T- 

W mismatched 8 1 -base pair DNA substrate indicated that the complex asymmetrically protects 

|5 abovit 25 nucleotides on both strands of the substrate. As shown in Figure IC, there appeared to 

20 be two domains protected by the complex from cleavage by DNase, One domain appeared to 
be centered on the G/T mismatch in the substrate. The other domain was adjacent the domain 
centered on the G/T mismatch and was separated from that domain by a single DNase-sensitive 
nucleotide. These data axe qualitatively similar to those observed in similar experiments using 
the E. coH and T. aquaticus MutS proteins (Su et al, 1986, Proc. Natl. Acad. Sci., USA 
25 83:5057-5061; Su et al., 1988, J. Biol. Chem. 263:6829-6835; Biswas et al, 1997, J. Biol. 
Chem. 272: 13355-13364). 

[0182] Although a shifted complex could be detected by gel mobility shift assay using 
homoduplex DNA, no specific DNase footprint could be identified, as indicated by the data 
presented in Figure ID. Lack of saturatability and lack of a specific footprint are consistent 
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with the abiUty of the hMSH2:hMSH6 heterodimer to weakly and non-specifically associate 
with homoduplex DNA. 

[01 83] Shifted complexes formed between the heterodimer and homoduplex DNA and 
those formed between the heterodimer and G/T-mismatched DNA migrated differently in gel 
mobility shift assays, as shown in Figures 1 A and IB. Homoduplex DNA-bound heterodimer 
(designated 'NS' for 'non-specific' in Figure IB) migrated more slowly than G/T-mismatched 
DNA-bound heterodimer (designated 'S' for 'specific' in Figure 1 A). These results suggest that 
homoduplex DNA-bound heterodimer adopts a different conformation than mismatched DNA- 
bound heterodimer. Alternatively, there may have been a greater quantity of the heterodimer 
bound to homoduplex DNA than to mismatched DNA. 

[0184] When the homoduplex 39-base pair DNA substrate described herein was contacted 
with the heterodimer, no NS product was observed in the gel mobility shift assay. The DNA 
length dependence of NS product formation may result if a minimum number of base pairs were 
necessary to assume an alternative DNA and/or hMSH2- or hMSH6-protein conformation or to 
bind multiple hMSH2:hMSH6 heterodimers. 

[0185] These results demonstrate the high specificity of heterodimer binding to the G/T- 
mismatched 81 -base pair DNA substrate. The binding was found to be quantitatively similar by 
both gel mobility shift and footprint analysis. In addition, a low level non-specific binding to 
duplex DNA was observed and found to be easily distinguished via its altered mobility using 
gel mobility shift analysis. 

The hMSH2:hMSH6 heterodimer converts ATP 
to ADP in the presence of mismatched DNA 

[0186] Both bacterial and yeast MutS homologs have been shown to possess intrinsic low- 
25 level ATPase activity (Alani et al, 1997, Mol. Cell Biol. 17: 2436-2447; Chi et al, 1994, J. 

Biol. Chem. 269: 29993-29997; Chi et al., 1994, J. Biol. Chem. 269:29984-29992; Habe et al., 
1988, J. Bacteriol. 170:197-202). There are conflicting reports regarding the capacity of 
mismatched heteroduplex and/or homoduplex DNA to stimulate this intrinsic ATPase activity 
(Alani et al., 1997, MoL Cell Biol. 17: 2436-2447; Chi et al., 1994, J, Biol. Chem. 269: 29993- 
30 29997; Chi et al, 1994, J. Biol. Chem. 269:29984-29992). 
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[0187] It was demonstrated in the experiments described in this Example that the 
hMSH2:hMSH6 heterodimer possesses intrinsic DNA-dependent ATPase activity that is 
dependent upon the presence of magnesium as a cofactor. Saturation of the ATPase activity by 
hMSH2:hMSH6 heterodimer which was observed at protein concentrations above 0.6 
5 micromolar was likely the result of a limiting amount of DNA, which was use at a fixed 
concentration of 240 nanomolar in the assay. 

[0188] Thin layer chromatography revealed that hMSH2:hMSH6 heterodimer ATPase 
activity uniformly converts ATP to ADP and inorganic phosphate. Using Lineweaver-Burk 
analysis and Eadie-Hofstee analysis, it was determined that hMSH2:hMSH6 heterodimer 
10 ATPase is most active in the presence of a G/T mismatch. The value of k^^^ using ATP and 
O G/T-mismatched DNA as sxibstrates was about 26 minute"-^ . The value of K^^ using ATP and 

CSi G/T-mismatched DNA as substrates was about 46 micromolar. hMSH2:hMSH6 heterodimer 

% ATPase is substantially less active in the presence of homoduplex DNA. The value of k^^ 

:;i using ATP and G/C-mismatched DNA as substrates was about 7.4 minute' ^ The value of 

15 using ATP and G/C-mismatched DNA as substrates was about 23 micromolar. hMSH2:hMSH6 
heterodimer ATPase is substantially inactive in the absence of DNA. The value of k^^ using 
|i ATP alone as a substrate was about 0.9 minute" . The value of using ATP alone as a 

^ substrate was about 10 micromolar. 

|i [0189] ATPase activity stimulation was the same regardless of whether the homoduplex 

20 DNA had a length of 39 base pairs, 81 base pairs or 2,900 base pairs, and was also the same 
regardless of whether the mismatched DNA had a length of 39 base pairs or 81 base pairs. 
These results indicated that hMSH2:hMSH6 heterodimer ATPase activity is not dependent 
upon DNA length. 

[0190] It was observed that k^^^ using ATP alone as a substrate was lower than k^^^ using 
25 ATP and homoduplex DNA as a substrate and this value was lower than k^^^ using ATP and 

mismatched DNA as substrates. However, for ATP in the absence of DNA was lower than 
for ATP in the presence of homoduplex DNA, and this value was lower than BC^ for ATP 

in the presence of mismatched DNA. These observations indicated that although the rate of 

hydrolysis is increased in the presence of a mismatch, the affinity for ATP is decreased. These 
30 results are qualitatively similar to the phenomenon of uncompetitive inhibition which may be 
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ascribed to the presence of independent and separate binding sites as well as a ping-pong 
binding mechanism (Dixon et al., 1979, Enzymes, 3rd Ed., Academic Press, New York). 
[0191] Single-stranded DNA (ssDNA) was determined to be the most potent stimulator of 
hMSH2:hMSH6 heterodimer ATPase activity. Thus, the conflicting reports in the prior art 
5 regarding ATPase activities of related MutS homologues may have resulted from contamination 
by ssDNA leached from columns used to purify the homologues and/or by non-aimealed 
ssDNA that remained following preparation of oligonucleotide substrates. 



hMSH2:hMSH6 heterodimer mismatch binding is abolished 
10 in the presence of ATP in the absence of hydrolysis of ATP 
O [0192] Both bacterial and eukaryotic MutS homologs have been reported to fail to form a 

rp specific complex with a mismatched oligonucleotide in the presence of ATP (Drummond et al., 

1995, Science 268:1909-1912; HaberetaL, 1991,EMBO. J. 10:2707-2715; Alaai et al., 1997, 
j Mol. Cell Biol. 17: 2436-2447; Grilley et al., 1989, J. Biol. Chem. 264:1000-1004). Before the 

15 present invention, it was believed that ATP hydrolysis catalyzed by MutS protein drove 

translocation of the protein along a duplex DNA strand, causing dissociation of the protein from 

■fi any mismatch with which it might be associated (Grilley et al., 1989, J. Biol. Chem. 264:1000- 

W 

IV 1004; Modrich, 1989, J. Biol. Chem. 264:6597-6600; Modrich, 1991, Annu. Rev. Genet. 

P 25:229-253; Modrich et al., 1996, Annu. Rev. Biochem. 65:101-133; Allen et al., 1997, EMBO 

20 J, 16:4467-4476). The suggestion that ATP hydrolysis was required for the mismatch release 
was based on the observation by others that adenylyl-imidodiphosphate (AMP-PNP), a non- 
hydrolyzable analog of ATP, does not alter mismatch binding (Alani et al., 1997, Mol. Cell 
Biol. 17: 2436-2447; Drummond et al, 1995, Science 268:1909-1912), 
[0193] The experiments described in this Example establish that the hMSH2:hMSH6 

25 heterodimer is released from a G/T-mismatched DNA substrate in the presence of ATP, as 

indicated in Figures 2A and D. The value of IC^q (the concentration of ATP required to cause 
release of 50% of a population of heterodimers from a G/T-mismatched DNA substrate) was 
determined to be approximately 3 micromolar. Adenosine-5'-0-3-thiotriphosphate (ATP- 
gamma-S), a poorly-hydrolyzable ATP analog (Sekimizu et al., 1987, Cell 50:259-265; Yu et 

30 al, 1992, J. Mol. Biol. 225:193-216), caused a similar release of the hMSH2:hMSH6 
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heterodimer from a G/T-mismatched DNA substrate, the value of IC^q for ATP-gamma-S 
being 3 micromolar, as indicated in Figures 2B and 2D. Addition of ADP to the mismatch 
binding reaction mixture resulted increased binding affinity of the heterodimer for the G/T- 
mismatched DNA substrate, as indicated in Figures 2C and 2D. 
5 [01 94] The results presented in this Example demonstrate that release of the 

hMSH2:hMSH6 heterodimer from a G/T-mismatched DNA substrate with which it is 
associated is not dependent upon ATP hydrolysis. This conclusion follows from the 
observations that release of the complex occurs in the absence of exogenous magnesium and 
that release of the complex from the substrate is effected by the presence of ATP-gamma-S 
10 regardless of the presence or absence of magnesixmi. The presence of magnesium is absolutely 

O required for hMSH2:hMSH6 heterodimer-dependent ATP hydrolysis. Furthermore, NS 

m binding of hMSH2 to homoduplex DNA is insensitive to the addition of exogenous ATP. 

't^ Thus, the presence of ATP affects only the ability of the hMSH2:hMSH6 heterodimer to bind to 

mismatched DNA substrates. Binding of the heterodimer to homoduplex DNA is not affected 

J 15 by ATP. 

f n [0195] The presence of 2'-deoxy adenosine triphosphate (dATP) to the mismatch binding 

S^H reaction mixture caused release of a G/T-mismatched DNA substrate from the hMSH2:hMSH6 

tu heterodimer, similarly to the release caused by the presence of ATP or ATP-gamma-S in the 

mixture, as illustrated in Figure 3. No other nucleotide was found to stimulate the release of the 
20 G/T-mismatched DNA substrate from the heterodimer. 

[0196] Neither of two other non-hydrolyzable analogs of ATP, namely AMP-PNP and 
adenyl-(beta-, gamma-methylene)-diphosphonate (AMP-PCP), caused release of the 
heterodimer from the substrate. Equilibrium competition between each of these two analogs 
and ATP suggested that they bind to the heterodimer and caused effects similar to those caused 
25 by ADP. Failure of AMP-PNP and AMP-PCP to stimulate release of mismatched DNA from 
the heterodimer demonstrated that the interaction between the beta-gamma bridging oxygen 
atom of ATP and either the heterodimer or the mismatched DNA substrate bound to the 
heterodimer are for release of the substrate from the heterodimer. Enzyme-nucleotide 
triphosphate complexes in which the beta, ganrnia oxygen atom interacts with either the enzyme 
30 or its substrate are not unknown. For example, the Ras GTPase binds GTP, and donation of a 
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hydrogen bond to the beta-gamma bridging oxygen of GTP is thought to contribute to catalysis 
by the enzyme (Maegley et ah, 1996, Proc. Nath Acad. Sci. USA 93:8160-8166). 
[01 97] The resuhs presented in this example demonstrate that the hMSH2:hMSH6 
heterodimer binds to a mismatched DNA substrate in the presence of ADP, and that the 
5 substrate is released from the heterodimer in the presence of ATP or dATP. Because ATP- 
induced release of the substrate from the heterodimer does not require magnesium and is 
similarly induced by ATP-gamma-S, ATP hydrolysis is not implicated in substrate release. As 
increasing amounts of ATP or ATP-gamma-S were added to the mismatch binding reaction 
mixture, approximately 15% of S-shifted material gradually became re-associated with the 
10 DNA in the form of a NS-shifted heterodimer, as indicated in Figures 2A and 2B. This fraction 

0 was consistent with the amount of NS binding observed for homoduplex DNA at this 
concentration of the heterodimer, as indicated in Figure 2B. These results indicated that 

't^: hMSH2:hMSH6 heterodimers which dissociated from mismatched substrate could re-associate 

' fi with either the duplex arms or the ends of the substrate. 

1 15 

U ATP hydrolysis catalyzed by the hMSH2:hMSH6 

heterodimer results in recovery of mismatch 
IIJ binding activity of the heterodimer 

[0198] To determine the role of ATP hydrolysis in mismatch recognition, ATP or ATP- 

20 gamma- S was introduced into a mismatch binding reaction mixture in the absence of 

magnesium. As illustrated in Figures 2A, 2B, 2D^ and 3, introduction of either compound 
resulted in release of the hMSH2:hMSH6 heterodimer from the mismatched DNA substrate in 
the absence of hydrolysis of the compound. In experiments presented in Figure 4A, magnesium 
was added to each reaction mixture, which was maintained at 37°C, and the G/T mismatch 

25 binding activity of hMSH2:hMSH6 heterodimer was followed over time, with time zero 

corresponding to the time at which magnesium was added. In the reaction mixture comprising 
ATP, mismatched DNA substrate binding activity of the heterodimer was initially low, nearly 
70% of this activity was recovered after ten minutes of incubation at 37°C, and more than 95% 
of the activity was recovered fifty minutes after magnesium addition. Substantially less (about 

30 22%) of mismatched DNA substrate binding activity was recovered in the reaction mixture to 
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which ATP-gamma-S was added. These results demonstrated that efficient hydrolysis by the 
heterodimer is essential for recovery of the heterodimer's mismatch binding activity. 
Substitution of ATP with dATP produced quantitatively similar recovery of mismatch binding 
activity (i.e. >95% recovery) following incubation at 37°C. Taken together, these results 
demonstrated that the intrinsic ATPase activity associated with the human hMSH2:hMSH6 
heterodimer is required for recovery from mismatch-release induced by binding to and/or 
exchange with, ATP or dATP. 

[0199] Complete recovery of mismatched DNA substrate binding activity of the 
hMSH2:hMSH6 heterodimer, which activity was abolished by exposing the heterodimer to 
ATP, was achieved by increasing the ratio of the concentration of ADP to the ratio of ATP in 
the solution in which the heterodimer was suspended, as indicated in Figure 4B In this 
competition experiment, mismatch binding reaction mixtures comprised 0.2 millimolar ATP, 1 
millimolar MgCl2, and a selected concentration of ADP from 0 to 3.2 millimolar. It was 
determined that a 2- to 3 -fold excess of ADP to ATP resulted in reversal of approximately half 
of the release of substrate by the heterodimer caused by the presence of ATP. Approximately 
complete reversal of substrate release caused by the presence of ATP was achieved by 
providing a 16-fold excess of ADP to the mixture. A qualitatively similar, though functionally 
opposite, result was obtained when the competition was performed by including a fixed 
concentration of ADP in the reaction mixture and adding various concentrations of ATP. Thus, 
ADP and ATP are nearly equivalent in their ability to associate with the hMSH2:hMSH6 
heterodimer, but the two nucleotides elicited opposite functional effects on mismatch binding. 
ATP caused release of substrate bound to the heterodimer, and ADP induced binding of the 
substrate to the heterodimer. Therefore, ADP is responsible for mismatch binding recovery. 
[0200] Taken together, these observations support the conclusion that the hMSH2:hMSH6 
heterodimer functions as a molecular switch, wherein the ATP- (or dATP-) bound heterodimer 
is "OFF" (i.e. unable to associate with a mismatched DNA substrate with which it is not already 
associated) and the ADP-bound heterodimer is "ON" (i.e. able to associate with a mismatched 
DNA substrate with which it is not already associated). A model of the role of the 
hMSH2:hMSH6 heterodimer is illustrated in Figure 7. 
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[0201] ATP hydrolysis and ADP-ATP exchange determine mismatch binding functions of 
the hMSH2:hMSH6 heterodimer 

[0202] Steady-state analysis of an enzyme having ATPase activity reflects the rate-limiting 
step of the reaction, which can be either gamma-phosphate hydrolysis or adenine nucleotide 
5 exchange. To understand the mechanism of the ATPase activity exhibited by the 

hMSH2:hMSH6 heterodimer and to further define the rate-limiting steps, both hydrolysis and 
nucleotide exchange steps were directly examined. 

[0203] Initial rate (i.e. single-turnover) analysis of an enzyme which exhibits ATPase 
activity involves direct examination of the rate of gamma-phosphate hydrolysis, and was 
10 performed using a method which is similar to that used for the examination of regulators of G- 
Q protein signaling (RGS; Dohlman et al., 1997, J. Biol. Chem. 272:3871-3874). In these 

, experiments, 0.2 micromolar [ P]gamma-ATP was contacted with hMSH2:hMSH6 

1 ^7 

heterodimer in the absence of magnesium, yielding a heterodimer having a [ P] gamma- ATP 

molecule bound thereto. At a selected time, magnesium and an excess of non-labeled ATP 
Cq 15 were added to the reaction mixture, and the rate of a single-round of gamma-phosphate 

hydrolysis was assessed. Subsequent rounds of hydrolysis were undetectable because the ATP 
JB hydrolyzed during those rounds was not labeled. Because the calculated K^^ for ATP at 37°C 

til was in excess of 20 minute , and because this rate was above the limit of detection of this 

Q methodology, these initial rate experiments were performed at 20°C. It was determined that the 

20 hMSH2:hMSH6 heterodimer rapidly hydrolyzed ATP in either the presence or the absence of 

DNA. These results indicated that gamma-phosphate hydrolysis was not the rate limiting step 

in the steady-state ATP hydrolysis by the heterodimer. 

[0204] The extent of ATP hydrolysis which could be detected was equivalent to the total 
number of hMSH2:hMSH6 heterodimers which could be bound to P-labeled ATP prior to the 

25 addition of magnesium. The maximal extent of detectable ATP hydrolysis was determined to 
depend on the amount of the G/T-mismatched DNA substrate present in the reaction mixture 
during binding of labeled ATP to the heterodimer, as indicated in Figures 5 A and 5B. When 
the concentration of the G/T-mismatched DNA substrate in to the reaction mixture exceeded the 
apparent for G/T-mismatched DNA substrate (i.e. about 20 nanomolar), the maximal extent 

30 of ATP hydrolysis decreased, as indicated in Figure 5B. This observation indicated that 
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binding of the hMSH2:hMSH6 heterodimer to a mismatched DNA molecule prior to binding of 
ATP to the heterodimer inhibits binding of ATP to the mismatched DNA-bound heterodimer. 
This observation is consistent with the pseudo-uncompetitive behavior deduced in the steady- 
state ATPase activity experiments described herein (Dixon et al., 1979, Enzymes, 3rd Ed., 
5 Academic Press, New York). 

[0205] Adenine nucleotide exchange was assessed using a method similar to that used for 
guanine nucleotide exchange experiments involving G proteins. In these studies, [ H]-ADP 
was contacted with hMSH2:hMSH6 heterodimer in the presence of magnesium, yielding [ H]- 
ADP-bound heterodimer. At a selected time, an excess of non-labeled ATP was added to the 
10 reaction mixture, and the amount of ADP that remained bound to the heterodimer was assessed 
^i^^ at selected times. 

,"5? [0206] In the absence of DNA, incomplete ADP nucleotide exchange was observed during 

a 1 5 minute reaction period. The half-life of the ADP-bound heterodimer was greater than 
eight hundred seconds. These results clearly suggest that in the absence of DNA, replacement 
15 of ADP by ATP is the rate limiting step for the hMSH2:hMSH6 heterodimer ATPase activity. 

J.^ [0207] In the presence of G/T-mismatched DNA substrate, nucleotide exchange was 

significantly more rapid, the half^life of the ADP-bound heterodimer being less than two 

ilj seconds. Thus, it was demonstrated that binding of the heterodimer to a G/T-mismatched DNA 

y, substrate stimulated replacement of the labeled ADP molecule originally bound to the 

20 heterodimer by a non-labeled ATP molecule. 

[0208] Taken together vidth the results obtained from the single turnover hydrolysis 
experiments described herein, these observations indicated that in the absence of mismatched 
DNA, the hMSH2:hMSH6 heterodimer is capable of a single ATP hydrolysis reaction that 
yields an ADP-bound heterodimer. While in the ADP-bound form, the heterodimer does not 
25 exchange ADP for ATP until the heterodimer binds to a DNA mismatch. By binding to a 
mismatch, the ADP-bound heterodimer becomes competent to exchange ADP for ATP. 
Exchange of ADP for ATP causes release of the heterodimer from the mismatch. ATP-bound 
heterodimer, when no longer bound to mismatched DNA, is capable of catalyzing ATP 
hydrolysis, yielding ADP-bound heterodimer, which is competent to bind to a DNA mismatch. 
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These results indicate that the hMSH2:hMSH6 heterodimer is a molecular switch controlled by 
the phosphorylation state of the adenine nucleotide bound thereto. 

Release of the hMSH2:hMSH6 heterodimer from a 
5 G/T-mismatched DNA substrate may occur by dissociation 

[0209] Prior art models of mismatch recognition by MutS homologs implicated ATP- 
dependent translocation and/or treadmilling along DNA as a mechanism for association and 
dissociation of the homolog with a DNA mismatch (Modrich, 1989, J. BioL Chem. 264:6597- 
6600; Modrich, 1991, Annu. Rev. Genet. 25:229-253; Modrich et ah, 1996, Annu. Rev. 
10 Biochem. 65:101-133; Allen et ah, 1997, EMBO J. 16:4467-4476). Common to all of these 
prior art models is a postixlated time-dependent unidimensional homolog displacement 
7^ mechanism which occurs whether the homolog is bound to duplex DNA or mismatched DNA. 

In contrast, a simple dissociation mechanism would exhibit rapid and two-dimensional 
hQ displacement of the homolog from duplex DNA or mismatched DNA. 

Ifi 15 [0210] The ability to distinguish NS and S electrophoretic bands corresponding to the 

homologous 81 -base pair DNA substrate-bound hMSH2:hMSH6 heterodimer and the G/T- 
133 mismatched 81 -base pair DNA substrate-bound heterodimer, as illustrated in Figure 2 A, 

|tj provided an opportunity to examine the dissociation mechanism of the heterodimer from the 

■^2 G/T-mismatched DNA substrate, as well as from homoduplex DNA. In these experiments, the 

20 G/T-mismatched DNA substrate was bound to the heterodimer, and an excess of an unlabeled 
competitor DNA or an excess of ATP, or both, was added to the mixture. If a tracking or 
sliding mechanism of the prior art were operable for heterodimer dissociation, it would be 
expected that a time-dependent loss of the S shifted electrophoretic band of G/T-mismatched 
DNA substrate-bound complex would be observed, and that a coincident gain of the NS 
25 electrophoretic band would be observed. If a simple dissociation mechanism were operable for 
heterodimer dissociation, it would be expected that loss of the S shifted band would be 
observed without any coincident increase in the intensity of the NS shifted band because the 
vast excess of unlabeled homoduplex DNA would preclude secondary reassociation of the 
complex with the arms or ends of the labeled G/T-mismatched DNA substrate. One potential 
30 complication would be if the amount of time required for heterodimer enables diffusion of the 



-67- 



dimer to a different position on the DNA substrate were nearly the same as the time which 
would be required for simple dissociation. 

[021 1] Three experiments were performed to determine the mechanism of hMSH2/hMSH6 
protein complex dissociation from a labeled 81 -base pair G/T-mismatched DNA substrate. The 
5 results of these experiments are illustrated in Figure 6. 

[0212] In the first experiment, the stability of G/T-mismatched DNA substrate-bound 
hMSH2/hMSH6 complex was assessed by exposing the mismatched substrate-bound complex 
to a 400-fold excess of non-labeled homoduplex DNA and observing the intensities of S shifted 
and NS shifted electrophoretic bands at selected times, as illustrated in Figure 6C. Examination 
10 of the gel depicted in Figure 6C indicated that the S-shifted electrophoretic band, and thus the 
Q amount of the G/T-mismatched DNA substrate-bound hMSH2:hMSH6 heterodimer in the 

, reaction mixture, was not reduced significantly over the ten minute incubation period. Thus, 

the half^life of the G/T-mismatched DNA substrate-bound hMSH2:hMSH6 heterodimer was 
';|J much greater than ten minutes, meaning that the mismatched substrate-bound complex is stable 

15 in the presence of a vast excess of homoduplex DNA. 

[0213] In the second experiment, the stability of G/T-mismatched DNA substrate-bound 

"is?;;? 

hMSH2:hMSH6 heterodimer was assessed by exposing the mismatched substrate-bound 
ii 1j heterodimer to ATP and observing the intensities of S shifted and NS shifted electrophoretic 

I J bands at selected times, as illustrated in Figure 6 A, A gradual decrease in the intensity of the S 

20 shifted electrophoretic band was observed, the band having a half life of about twenty seconds. 
Concurrently with the decrease in the intensity of the S shifted electrophoretic band, a gradual 
but not quantitative increase in the intensity of the NS-shifted electrophoretic band was 
observed. This observation indicated that ATP induced a time-dependent reduction of specific 
binding of the hMSH2:hMSH6 heterodimer to the mismatched DNA substrate and that at least 
25 a portion of the heterodimer reassociated with the mismatched DNA substrate in a non-specific 
manner. However, this experiment did not distinguish between the tracking/sliding or simple 
dissociation and reassociation mechanisms. 

[0214] In order to attempt to distinguish between translocation and simple dissociation and 
reassociation, a third experiment was performed. In this experiment, the stability of G/T- 
30 mismatched DNA substrate-bound hMSH2:hMSH6 heterodimer was assessed by exposing the 
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mismatched substrate-bound heterodimer to both ATP and a 400-fold excess of non-labeled 
homoduplex DNA and observing the intensities of S shifted and NS shifted electrophoretic 
bands at selected times (Figure 6B). As in the second experiment, a gradual decrease in the 
intensity of the S shifted electrophoretic band was observed, the half-life of the band again 
5 being about twenty seconds. This observation was consistent with ATP induction of 

dissociation of the heterodimer from the mismatched DNA substrate. However, under these 
conditions, no increase in the intensity of the NS electrophoretic band was observed. Together, 
these observations suggest that in the presence of excess non-labeled homoduplex DNA, the 
dissociation of the heterodimer from mismatched DNA might not proceed through the product 
10 corresponding to the NS electrophoretic band, but instead may be instantaneous and 
irreversible. 

[0215] When excess non-labeled homoduplex DNA was added to the homologous 81 -base 
'^^^ pair DNA substrate, the NS electrophoretic band associated with the product formed by 

=•0 contacting the heterodimer with DNA substrate, as indicated in Figure IB, for example, could 

:|3 15 be detected, as indicated in Figure 6D, This observation indicated that, even at 4°C, the product 
% corresponding to the NS band was exceedingly unstable and that the level of hMSH2:hMSH6 

W heterodimer which remained associated with the DNA substrate was less than the lower limit of 

lU accurate quantitation using gel shift analysis. 

20 The hMSH2:hMSH6 heterodimer acts as a 
molecular switch in mismatch recognition 

[0216] The discovery that the hMSH2:hMSH6 heterodimer is a novel molecular switch 
which is activatable by ADP was made by reconciling numerous observations described herein. 
These observations are summarized as follows, ADP and ATP have opposing effects on the 

25 role of the hMSH2:hMSH6 heterodimer in mismatched DNA binding. Dissociation of 
mismatched DNA fi*om the hMSH2:hMSH6 heterodimer is not dependent upon ATP 
hydrolysis. Hydrolysis of ATP by the hMSH2:hMSH6 heterodimer results in recovery of the 
ability of the heterodimer to associate with mismatched duplex DNA. gamma-Phosphate 
hydrolysis is not the rate limiting step of ATPase activity catalyzed by the of the heterodimer. 

30 Displacement of ADP by ATP is the rate limiting step of ATPase activity catalyzed by the 
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]iMSH2:hMSH6 heterodimer. Displacement of ADP from the of the heterodimer by ATP is 
accelerated in the presence of mismatched duplex DNA, but hydrolysis of the gamma- 
phosphate bond is not accelerated. ATP-dependent release of mismatched DNA from the 
hMSH2:hMSH6 heterodimer occurs rapidly, possibly by simple dissociation or by rapid ATP- 
5 hydrolysis-independent diffusion to a free end of the DNA. These observations indicate that 
gamma-phosphate hydrolysis and displacement of ADP by ATP determine whether the 
hMSH2:hMSH6 heterodimer binds to or is released from mismatched duplex DNA, as 
illustrated in Figure 7, Recognition of the hMSH2:hMSH6 heterodimer as a molecular switch 
supports the conclusion that it is a trigger for determining the timing of subsequent excision 
1 0 repair-related events . 

^ Implications for mismatch repair 

[0217] The number of hMSH2:hMSH6 heterodimers in the nucleus of a proliferating cell 
'iQ has been estimated to exceed one thousand (Drummond et aL, 1995, Science 268:1909-1912; 

J 15 Wilson et al., 1995, Cancer Res. 55:5146-5150; Meyers et aL, 1997, Cancer Res. 57:206-208). 

The calculated K j of the heterodimer for mismatched DNA (i.e. about 20 nanomolar) implies 
W that a single mismatched nucleotide in a human cell is hkely to be efficiently recognized and 

ly bound with high affinity by an hMSH2:hMSH6 heterodimer. In the presence of ATP, this high 

g affinity binding is nearly irreversible. Thus, dissociating the heterodimer from mismatched 

20 DNA in order to allow a subsequent excision repair event to proceed may be more difficult than 

binding the heterodimer to the mismatch. 

Generality of MutS function 

[0218] The studies described in this Example, which involved the human mismatch binding 
25 reaction catalyzed by the hMSH2:hMSH6 heterodimer, are consistent with genetic studies 
performed in both bacteria and yeast. In those studies, mutation of the adenine nucleotide 
binding and hydrolysis domain(s) resulted in a dominant mutator phenotype (Haber et al., 1991, 
EMBO. J. 10:2707-2715; Wu et al, 1994, J. BacterioL 176:5393-5400; Alani et al., 1997, MoL 
Cell. Biol. 17; 2436-2447). Those studies, combined with the studies described in this 
30 Example, indicate that there may be two opposing functional alterations of MutS homologs that 
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can cause such a dominant mutator phenotype. First, alteration of the abihty of the homolog to 
bind and/or exchange ADP for ATP can cause a dominant mutator phenotype. Second, 
aheration of the abihty of the homolog to hydrolyze ATP can similarly cause such a phenotype. 
Inability of the homolog to bind to ADP or to exchange ADP for ATP would result in a 
permanently mismatched DNA-bound form of the MutS homolog. This form of the homolog 
would exclude the repair machinery from the mismatch site. Inability of the MutS homolog to 
hydrolyze ATP would result in a form of the homolog that would be unable to bind to 
mismatched DNA and which, therefore, would be unable to recruit the cellular mismatch repair 
proteins and factors to the site of the mismatch. Each these conditions would cause an 
increased mutation rate in the organism containing the homolog, as a consequence of the 
organism's depressed ability to repair mismatched DNA (Wu et al, 1994, J. Bacteriol. 
176:5393-5400). 

[0219] Preliminary studies performed using the methods described herein and using 
purified Escherichia coli MutS protein suggest that E. coli MutS also functions as a molecular 
switch, albeit with a more stringent requirement for mismatch-induced nucleotide exchange. 
Therefore, the properties of the MutS homologs hMSH2 and hMSH6, as described herein 
appear to be properties of all MutS homologs, including, but not limited to, E. coli MutS, and 
the human MutS homologs hMSH2, hMSH3, and hMSH6. 

Similarity of the hMSH2:hMSH6 heterodimer to G-protein switches 

[0220] The hMSH2:hMSH6 molecular switch is, in some respects, similar to G-protein 
switches which have been described (Bokoch et al., 1993, FASEB J. 7:750-759). G-proteins 
are known to trigger translocation events associated with protein synthesis (Laalami et al., 
1996, Biochimie 78:577-589; Parmeggiani et al, 1981, Mol. Cell Biochem. 35:129-158), 
cascade events associated with cell signaling (Medema et al, 1993, Grit. Rev. Oncol. 4:615- 
661; WiesmuUer et al., 1994, Cell Signal 6:247-267) and physiological responses to hgand- 
binding by membrane receptors (Spiegel, 1987, Mol. Cell. Endocrinol. 49:1-16). Many G- 
proteins are associated with regulators that stimulate both the GTPase activity of the G-protein 
(Tocque et al, 1997, Cell Signal. 9:153-158) and the exchange of G-protein-bound GDP for 
GTP (Dohlman et al, 1997, J. Biol Chem. 2 72:3871-3874; Quilliam et al, 1995, Bioessays 
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17:395-404). In fact, the Ras G-protein was determined to be unable to catalyze GTP 
hydrolysis because it is unable to exchange GDP for GTP. The discovery of a GTPase 
activating protein (GAP) that stimulated GTP gamma-phosphate hydrolysis, and a guanine 
nucleotide exchange factor (GNEF) that stimulated the exchange of GDP for GTP, provided a 
5 model for regulation of the Ras G-protein switch (Tocque et al, 1997, Cell Signal 9:153-158; 
Dohlman et al., 1997, J. Biol. Chem. 2 72:3871-3874). 

[0221] It has therefore been discovered that protein regulation of the excision-resynthesis 
processes associated with mismatch repair occurs by stimulation of the ATPase activity of the 
hMSH2:hMSH6 heterodimer or of the ability of the heterodimer to exchange ADP for ATP. 
10 The latter stimulation can occur either by stabilizing the ADP-bound form of the heterodimer or 
Q by stimulating exchange of ADP for ATP to effect release of the heterodimer from mismatched 

DNA. It is thought by the inventors that MutL homologs, such as the human MutL homologs, 
hMLH 1 , hPMS 1 , and hPMS2, perform these regulatory functions. 

iM 15 Example 2 

J:,^ A Mouse Construct NuUizygous for both msh2 and p53 and 

J* Methods of Making and Use Thereof 

IIJ [0222] Transgenic mice which are nuUizygous for both Msh2 and p53 have been made, and 

are referred to herein as Msh2"^'p53"^" mice. Other transgenic animals which are nuUizygous 
20 for both Msh2 and p53, and which particularly include mammals, especially including rodents 
such as mice and rats, may be made using methods analogous to those described herein and are 
useful in the screening methods described herein. 

[0223] The development of female Msh2~^'p53"^" mouse embryos is phenotypically 
arrested at approximately the 9.5 day stage, and apoptosis is induced shortly thereafter in the 
25 cells of these embryos. Male Msh2"^"p53"'^" mouse embryos are viable, but succumb to tumors 
significantly earlier than either Msh2"^"p53^^=^ or Msh2^^^p53"^" littermates (i.e. nuUizygous 
Msh2 mice or nuUizygous p53 mice, respectively). Furthermore, the frequency of 
microsatellite instability (MSI) in tumor tissue obtained from Msh2"^"p53"'^" mice is not 
significantly different than the frequency in tumor tissue obtained from Msh2"^"p53'^" mice. 
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Synergism in tumorigenesis and independent segregation of the MSI phenotype suggest that 
Msh2 and p53 are not genetically epistatic. 

[0224] Msh2"^"p53"^' mice are useful as models of disease or disorder states which cannot 
be identified in mice nuUizygous for only one of Msh2 or p53. Furthermore, Msh2"^'p53'^" 
5 mice are useful for identifying compositions which affect the onset or progression of such a 
disease or disorder state. Thus, a Msh2"^'p53'^" mouse is particularly useful as a model system 
for studying multistep tumorigenesis, apoptosis, and aging. 

[0225] The materials and methods used in the experiments presented in this Example are 
now described. 

10 

12 Generation of Msh2'^"p53"^" Mice 

^ [0226] Methods for making heterozygous and nuUizygous Msh2 mice and heterozygous 

'^jLt and nuUizygous p53 mice have been described (de Wind et ah, 1995, CeU 82:321-330; Reitmair 

J etal, 1995, Nature Genet. 11:64-70; Donehower et al, 1992, Nature 356:215-221; Jacks et al., 

J 15 1994, Curr. BioL 4:1-7; Purdie et al., 1994, Oncogene 9:603-609). 

[O227] Mice heterozygous for Msh2 (i.e. Msh2^^'p53^^"^ mice) on a mixed C57BL/6J and 
J 129/Ola background and mice heterozygous for p53 (i.e. Msh2'^^'^p53"^^" mice) on a mixed 

lli C57BL/6J and 129/Sv were mated to produce Fl progeny heterozygous for both genes (i.e. 

Msh2~^^"p53~^^' mice). Heterozygous sibling Fl progeny were intercrossed to produce progeny 
20 nuUizygous for both Msh2 and p53 (i.e. Msh2"^"p53'^" mice). Mice were genotyped using 

Msh2- and p53- specific PCR-based assays, using methods well known in the art. 

Isolation of Genomic DNA 

[0228] Mouse genomic DNA was extracted from ear-notched tissue of mice and from 
25 amniotic tissue of mouse embryos at 9.5, 1 1 .5, or 13.5 days of development, using a QIAamp 
Tissue Kit (Qiagen, Chatsworth, CA) according to the manufacturer's instructions. 
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PCR-based Genotyping of Mice 

[0229] A three-primer assay specific for Msh2 was carried out as described (Reitmair et al, 
1 995, Nature Genet. 1 1 :64-70). A four-primer assay specific for p53 was carried out using 50 
5 nanograms of template DNA in a 50 microliter reaction mixture containing 1 unit of Taq 

polymerase (Fisher Scientific, Malvern, PA) and 100 millimolar each of the following primers, 
each of which is identified with a five digit number and the sequence of each of which is listed: 
10681 (5'-GTGTTTCATT AGTTCCCCAC CTTGAC-3^ SEQ ID NO: 7); 
10480 (5'-ATGGGAGGCT GCCAGTCCTA ACCC-3'; SEQ ID NO: 8); 
10 10588 (5'-GTGGGAGGGA CAAAAGTTCG AGGCC-3'; SEQ ID NO: 9); and 
Q 10930 (5'-TTTACGGAGC CCTGGCGCTC GATGT-3'; SEQ ID NO: 10). 

.J The amphfication reaction involved 35 cycles of amplification (94°C, 15 seconds; 56°C, 30 

seconds; 72''C, 1 minute) using a Perkin-Elmer GeneAmp 9600 thermal cycler. The wild-type 
'lO primers, 10681 and 10480, amplified a product of about 320 base pairs length, and the targeted 

,k 15 allele (i.e. p53") primers, 1 0588 and 1 0930, amplified a product of about 1 50 base pairs length. 
0 [0230] The gender of embryos was determined using primers specific for the Y- 

^ chromosome gene as described (Sah et al, 1995, Nature Genet. 10:175-180). The presence of 

|1J the X-chromosome was confirmed separately in all cases using the following two X- 

chromosome specific primers to amplify the locus DXMIT6: 
20 5'-ACCATTCAAATTGGCAAGG-3' (SEQ ID NO: 1 1); and 
5'-GTGGCTCGAGTTGTTTGCAG-3 ' (SEQ ID NO: 12), 

PGR cycling conditions were as described above for p53 genotyping, except that the annealing 
temperature was 53°C, rather than 56°C. The X-chromosome specific primers amplified a 
product of about 210 base pairs in length. All PGR amplification products were resolved by 
25 electrophoresis on a 2% (w/v) agarose gel alongside a 100 base pair polynucleotide ladder 
standard and were visualized by ethidium bromide staining. 
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Timed Pregnancies 

[0231] Male and female mice having a known Msh2"^^'p53^^", Msh2^^"p53'^\ or Msh2"^" 
p53^^" genotype were mated and each of the females was examined daily for the presence of a 
5 vaginal plug (an indicator of pregnancy which appears at about day 0.5 of embryo 

development). Pregnant females were sacrificed at 13.5 days, at 1 1 .5 days, or at 9.5 days 
gestation. Embryos were dissected out from the pregnant females into Hank's Balanced Salt 
Solution (Gibco BRL, Grand Island, NY) under a dissecting microscope, fixed in 4% (v/v) 
buffered formalin, and documented by photomicrography. Amnion was retrieved from each 
10 embryo, DNA was extracted therefrom, and the sex and genotype of each embryo was 
i|2 determined by PGR. 

Histology 

SO [0232] Tissue specimens were fixed in 10% (v/v) or 4% (v/v) buffered formalin and 

15 embedded in paraffin. Histological analysis was carried out on 3 micrometer-thick sections 
stained with hematoxylin and eosin (H&E), 

lU 

I1J TUNEL Assay 

[0233] Paraffin-embedded tissue sections were de-waxed and rehydrated using a graded 
20 alcohol series, using methods well known in the art. Apoptotic cells and appropriate positive 
and negative control samples were analyzed using the In Situ Cell Detection Kit, AP with 
NBT/BCIP, manufactured by Boehringer Mannheim (Indianapolis, IN), according to the 
manufacturer's instructions. TUNEL-stained tissue sections were analyzed both by 
fluorescence microscopy and light microscopy. 

25 

Kaplan-Meier Survival 

[0234] Kaplan-Meier survival probability was calculated for mice that were found dead or 
were sacrificed when found to be moribund. The age of the mice was calculated in days. 
Because no mice died in the control group, confidence limits could not be determined. 

30 
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Microsatellite Instability in Lymphoid Tumors 

[0235] Paired ear-notch (i.e. normal) and lymphoid tumor tissues were analyzed for 
microsatellite instability at five chromosomal loci: D17Mitl23, D10Mit2, D6Mit59, D4Mit27, 
and D3Mit203. Microsatellite primer sequence pairs appropriate for amplification of these loci 
5 were obtained from the World Wide Web site of the Whitehead Institute for Genome Research 
(httpr/Zwww.genome.wi.mit.edu), and were chosen to amplify fragments containing at least 
twenty dinucleotide repeat sequences. PGR amplifications were carried out in a total reaction 
volume of 25 microliters, using 50 nanograms of DNA as template, 100 millimolar of each 
primer pair and 1 unit of Taq polymerase (Fisher Scientific, Malvern, PA). The amplification 
10 reaction involved 35 cycles of amplification (94^G, 15 seconds; 56'^G, 30 seconds; 72'^G, 1 
Q minute). Amplified products were resolved by electrophoresis on a 6.7% (w/v) denaturing 

^ poly aery lamide gel and were visualized by silver nitrate staining of the gel. 

•^^ [0236] The results of the experiments presented in this Example are now described. 

^^13 [0237] Twenty-one Msh2"^"p53"^' mice were generated from Msh2"^^"p53'^^", Msh2"^" 

15 p53'^^", or Msh2~^^"p53"^" parents. When the gender of each of the twenty-one Msh2"^"p53"^' 

J,^ mice was examined, all were determined to be male Msh2'%53"^~ mice. The absence of 

I I 

female Msh2"'"p53"'" offspring is highly significant (p < 0.001) and is unlikely to reflect the 

\ y 

lU intrinsic bias for males observed in the colony from which the mice were derived, wherein the 

normal male:female ratio is 181:138„ 

20 [0238] The fertility of male Msh2"%53"^' mice could not be determined, because they 
succumbed to tumors before they successfully mated. However, pathological examination of 
the testes of the male Msh2"^'p53"^" mice did not reveal gross abnormalities upon autopsy, and 
histology revealed mature spermatogenesis in all twenty-one of the male Msh2"^"p53"^" mice. 
Taken together, these resuhs suggest that Msh2"^"p53"^" male mice are not sterile. 

25 [0239] No gross morphological abnormalities were observed in Msh2"^" animals either in 
utero or post-natally (de Wind et al., 1995, Cell 82:321-330; Reitmair et al., 1995, Nature 
Genet. 1 1 :64-70). In addition, the number of male and female Msh2"^" mice in the studies 
described herein was in accord with the expected 1:1 ratio, which suggests that male and female 
nullizygous Msh2 mice are equally viable. However, a decrease in the number of live bom 

30 nuUizygous p53 mice from the expected Mendelian ratio was observed, which is qualitatively 
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similar to previous reports, although our limited numbers did not indicate a sex bias (Sah et al., 
1995, Nature Genet. 10:175-180; Nicols etal., 1995, Nature Genet. 10:181-187). 
[0240] No female Msh2"/"p53"^" mice were observed at weaning and none of thirteen one- 
day-old pups which were found dead in the Utters of mating pairs were Msh2"^"p53"^". Thus, all 
female embryos nullizygous for both Msh2 and p53 died in utero. To determine the point in 
embryonic development at which these embryos died, numerous timed pregnancies were 
established. Because Msh2"'^"p53"^" males were not available and Msh2-/-p53'^" females were 
not viable, pairs of mice, each of which mice was a known Msh2^^"p53^^", Msh2~^^'p53"^', or 
Msh2"^"p53+^" mouse, were mated to produce Msh2"^'p53"^" embryos. Pregnant females were 
sacrificed at 9.5, 11. 5, and 13.5 days of gestation, the embryos were pathologically assessed for 
developmental defects and the genotype and gender of each embryo were determined by PGR. 
The results of these analyses are presented in Table 1 . A total of twenty-one embryos and six 
resorption sites were recovered from three females at day 13.5 of gestation. Of the twenty-one 
13.5 day embryos, two male Msh2"^'p53"^" embryos and no female Msh2"^"p53"^" embryos 
were recovered, although a total of five Msh2"^"p53 '^"embryos were statistically expected. Two 
13.5 day embryos (one male Msh2"^^"p53"^'; one female Msh2"^"p53'^^') displayed exencephaly, 
while all other 13.5 day embryos appeared normal (Sah et al, 1995, Nature Genet 10:175-180). 



Table 1 



Sex and Morphological Phenotype of Timed Post-Implantation Embryos 



Days 


Resorption 


#of 


Embryos 


Female 


Male 


Development 


Sites 


Embryos 


Typed 


Msh2"^"p53"^" 


Msh2"^"p53'^" 










Nor Abnr 


Nor Abnr 


e9.5 


3 


30 


28 


3 1 


2 1 


el 1.5 


11 


21 


17 


0 4 


2 0 


el3.5 


6 


21 


21 


0 0 


2 0 


*28 




*96 


*96 


*0 *0 


*21 *0 



* Refers to live-bom animals at twenty-eight days following birth. 
[0241] In Table 1, embryos that arrested in development, that were in resorption, or that 
displayed gross abnormalities were classified as abnormal (Abnr), while those embryos which 



-77- 



were not arrested in development, were not in resorption, and did not display gross 
abnormalities were classified as normal (Nor), Thirteen newborn pups that were found dead, 
none of which were Msh2'^"p53"'^', are not represented in this Table. 

[0242] Twenty-one embryos and eleven resorption sites were recovered from three pregnant 
5 females at day 1 1 .5 of gestation. Of these, complete PGR typing results were determined for 
seventeen embryos and one resorption site. Five embryos were determined to be Msh2"^"p53"^", 
although eight Msh2"^"p53'^' embryos were statistically expected. Two of the five embryos 
were males that appeared morphologically normal (one is depicted in Fig. 9A), and three of the 
five embryos were females, all three of which had undergone developmental arrest, and all three 
10 of which are depicted in Figures 9B, 9C, and 9D. The three female Msh2"^"p53"^" embryos 
appeared opaque and somites were not visible. Based on the gross morphology of the three 
female Msh2"^"p53''^' embryos, it was estimated that they died at 9.5 days of development. The 
tissue from the resorption site was typed as female Msh2'^"p53"^". 

[0243] Thirty embryos and three resorption sites were recovered from pregnant females at 

15 day 9.5 of gestation. Twenty-eight embryos and one resorption site were successfully typed. 
Two embryos and a resorption site were found to be male Msh2"^"p53"^", and four embryos 
were typed as female Msh2"^"p53"^". Six Msh2"^"p53"^" embryos were statistically expected. 
Neither of the male Msh2"^"p53"^" embryos exhibited any gross morphological abnormality. It 
is likely that the male Msh2"^'p53"^' resorption site represents a spontaneous abortion event. In 

20 one of the four female Msh2"^"p53'^" embryos, the anterior neural tube was not closed and the 
heart was not seen to beat, which should occur around day 9 of development. These 
observations are consistent with a developmental delay that could result from late fertilization 
or implantation or alternatively, from a developmental abnormality that is apparent at day 9.5. 
[0244] Paraffin embedded tissue sections from wildtype and Msh2"^"p53"^" female 

25 embryos, as depicted in Fig. 10, from Msh2"^" embryos, and from p53'^" embryos were 

examined at day 1 1 .5 and at day 13.5. While the wildtype, Msh2"^", and p53"^" embryos had 
clearly distinguished developmental features at day 1 1.5, the arrested Msh2"^'p53'^" female 
embryos contained noncohesive cells without preservation of embryonal tissue structures. In 
addition, H&E stained Msh2"^'p53"^" female embryonic tissue sections appeared to contain an 

30 large number of "blebbed" structures typical of apoptotic cells. Furthermore, loss of nuclear 
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hematoxylin stain typical for necrosis was not observed in H&E stained Msh2 p53 female 
embryonic tissue sections (Fig. 10, Panel B). 

[0245] TUNEL staining was performed on the paraffin embedded tissue sections (Fig. 10, 
Panels C-F). Although wildtype (Fig. 10, Panels C and E), Msh2"^", and p53"^' embryos 
5 displayed circumscribed foci of apoptotic cells characteristic of normal embryonal 

development, Msh2"^"p53"^" female embryos displayed global catastrophic apoptosis (Fig. 10, 
Panels D and F). Furthermore, fluorescence TUNEL staining of Msh2"^"p53"^' female embryos 
revealed a speckled intracellular patterning characteristic of fragmented chromatin (Fig. 10, 
Panel F). It was estimated that between about 60% and about 90% of cells in Msh2"^"p53"^" 
10 female embryos were undergoing visible apoptosis, as assessed by H&E and TUNEL staining. 
O [0246] Kaplan-Meier survival analysis was performed on a cohort of ninety-six mice, the 

:,p data for which analysis are graphically presented in Fig. 1 1 . Msh2 p53" mice began to die of 

generalized lymphomas at day 53 after birth and all twenty-one mice were dead 

■jb within four months of birth. In contrast, only 18% (eight of forty-four) of Msh2"^" littermates 

h§ 15 and 71% (five of seven) of p53 littermates were dead at the time the mice were analyzed. 
J^t Thus, Msh2"^"p53'^' mice had a significantly (p<0,001) reduced median survival time of 73 

days compared with the median survival time of either Msh2"^" mice (i.e. 200 days) or p53"'^" 
ilJ mice (i.e. 149 days). Furthermore, all twenty-four wild-type (i.e. Msh2'^'^'^p53"*"^'^) littermates 

were alive after approximately ten months. These results indicate that Msh2 and p53 null 
20 mutations cooperatively promote tumorigenesis. p53 has also been shown to cooperate with a 
variety of other genes in mouse tumorigenesis models (Blyth et al., 1995, Oncogene 10:1717- 
1723; Williams et al, 1994, Cold Spring Harbor Symp. Quant. Biol. 59:449-457; Williams et 
al., 1994, Cell 79:329-339; Donehower et al, 1995, Genes Dev. 9:882-895; Nacht et al., 1996, 
Genes Dev. 10:2055-2066). However, as is apparent from Fig. 1 1, the effect on tumor-related 
25 death of having dual null mutations of Msh2 and p53 is greater than the sum of the effects of 
having a single null mutation in Msh2 or p53 alone. Thus, the Msh2'^"p53"^" mouse described 
herein has a phenotype which is significantly different from a mere combination of the 
phenotype of a Msh2"^" mouse and the phenotype of a p53"^" mouse. 

[0247] Pathological examination of tumors showed that all twenty-one Msh2"^"p53'^' mice 
30 developed highly aggressive generalized lymphomas involving major organs. In addition, a 
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pleomorphic sarcoma in the flank, a malignant fibrous histiocytoma of the neck, and a tubular 
adenoma of the small intestine were observed, while other epithelial neoplasms were not 
detected. The tumor spectrum of Msh2"^" and p53"^' mice appeared similar to previous 
observations (de Wind et al, 1995, Cell 82:321-330; Reitmair et al, 1995, Nature Genet. 1 1 :64- 
70; Donehower et al, 1992, Nature 356:215-221; Jacks et ah, 1994, Curr. Biol. 4:1-7; Purdie et 
al, 1994, Oncogene 9:603-609). The tumor spectrum of Msh2'^"p53"^" mice differs 
significantly from the tumor spectrum of either Msh2"^" mice or p53'^' mice. Thus, Msh2"^" 
p53"^' mice have utility different from that of either Msh2"^" mice or p53'^' mice. 
[0248] Normal and tumor tissues obtained from individual Msh2"^'p53'^" mice were 
examined for microsatellite instability at five loci: D17Mitl23, D10Mit2, D6Mit59, D4Mit27, 
and D3Mit203. The results of these MSI studies are presented in Table 2. The frequency of 
MSI in tumor tissues obtained fi-om Msh2"^" mice was not significantly different (p>0.05) from 
the frequency of MSI in tumor tissues obtained from Msh2"^"p53"^" mice. Microsatellite 
instability was not observed in lymphomatous tumor tissue obtained from the seven p53"^' mice 
examined. The observation that Msh2"^"p53"^" mice developed earlier onset of tumor-related 
disease, combined with the observed separate segregation of the MSI phenotype with the Msh2 
allele, suggests that Msh2 and p53 are not genetically epistatic. 
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Table 2 

The Frequency of Microsatellite Instability in p53"^", Msh2"^", and 

Msh2"^"p53"^" Mice 



Genotype 


Tumors 


MSI at 


MSI at >2 


MSI at >3 


Tumor / Normal Pairs 


Examined 


>1 Locus 


Loci 


Loci 




(n) 








p53"^" 


7 


0 (0%) 


0 (0%) 


0 (0%) 


Msh2"^" 


8 


6 (75%) 


4 (50%) 


3 (38%) 


*Msh2"^"p53"''" 


21 


17(81%) 


14 (67%) 


12 (57%) 



[0249] 



^Because female Msh2"^"p53"^" mice died during embryonic development, this refers to 
only male Msh2"^"p53"^' mice. 

It is remarkable that female Msh2"^"p53"^" mouse embryos underwent global 



developmental arrest and that widespread apoptosis of the cells of such embryos occurred 
around day 9.5 of development. That these embryos underwent implantation and gastrulation 
strongly suggests that they are capable of executing the earlier stages of embryogenesis. The 
arrested phenotype is reminiscent of that described for a small proportion of female p53'^' mice 
(Sah et ah, 1995, Nature Genet. 10:175-180). However, unlike p53"^" mice, no normal female 
Msh2"^"p53"^" mice or embryos were observed beyond 9.5 days of embryonic development. 
This observation supports the conclusion that the female embryonic lethality of Msh2"^"p53"^" 
mice is highly penetrant. In addition, none of the female Msh2"^"p53"^" embryos displayed the 
15 exencephaly that characterized the p53"^" mice (Sah et al, 1995, Nature Genet. 10:175-180). 
Furthermore, while there was no difference in apoptosis observed in developing p53"^" mouse 
embryos, global catastrophic apoptosis was clearly observed in all the Msh2"^'p53"^" female 
mouse embryos examined at day 9,5 of development. These results suggest that female Msh2" 
^"p53"'^" mice succumb at an earlier stage and by an entirely different pathology than p53'^" 
20 mice. 

[0250] Without being bound to any particular theory, the lethality observed in female 
Msh2'^"p53"^" mouse embryos is consistent with the following explanation. In the female 
embryonic lineage, dosage compensation is achieved by random X chromosome inactivation 
around the time of gastrulation, at which time intense embryonic cellular proliferation and 
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apoptosis promote embryonic differentiation (Lyon, 1961, Nature 190:372-373; Rastan, 1994, 
Curr. Opin. Genet. Dev. 4:292-297; Theiler, 1972, In: The House Mouse Development and 
Normal Stages from Fertilization to 4 Weeks of Age, Springer- Verlag, Nev^ York, p. 168). The 
global apoptotic effect need not occur coincidentally with X chromosome inactivation. The fall 
effect of dysregulation may only become apparent after a number of cell divisions when the 
embryo undergoes a fiirther burst of proliferation during embryonic 'turning' between 8 and 9.5 
days. 

[0251] It has been shown that the inactivated X chromosome replicates late in S phase 
(Taylor, 1960, J. Biophys. Biochem. Cytol. 7:455-464; Tagaki, 1974, Exp. Cell. Res. 86:127- 
135). In addition, cells deficient in p53 have been shown to be defective for damage-induced 
G|/S checkpoint arrest, and cells that are deficient in MMR have been shown to be deficient for 
damage-induced G2/M checkpoint arrest (Baker et aL, 1990, Science 249:912-915; Diller et aL, 
1990, MoL Cell, Biol. 10:5772-5781; Lin et aL, 1992, Proc. Natl. Acad. Sci. USA 89:9210- 
9214; Hawn et aL, 1995, Cancer Res. 55:3721-3725; Marra et aL, 1996, Oncogene 13:2189- 
2196). Thus, female-specific Msh2"^'p53"^' embryo lethality may result from dysregulation of 
damage-induced arrest checkpoint control, wherein such dysregulation is caused by a deficiency 
of both p53 and Msh2, and whereby such dysregulation results in an inability of Msh2"^"p53'^" 
cells to arrest cell division and repair damage introduced into the late replicating inactive X 
chromosome. Such damage could take the form of non-replicated regions or chromosomal 
fragments that have resulted from inappropriate cell division prior to the completion of inactive 
X chromosome replication. Fragmented, reactivated, or otherwise altered inactive X 
chromosomes may then lead to global catastrophic cellular failure, developmental arrest, and 
apoptosis. Furthermore, the observation that the highest levels of p53 mRNA are detected in 
wild-type embryos between 9 and 1 1 days of development suggests an important role for p53 
protein within this time frame (Rogel et aL, 1985, Mol. Cell. Biol. 5:2851-2855), 
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Example 3 

A Discussion of hMSH2:hMSH6 Heterodimers in the Context of 
Mismatch Repair, Molecular Switches, and Signal Transduction 
[0252] The foundation of molecular switches in biology is grounded in translation 
5 elongation and cellular signal transduction. In these systems, guanine nucleotide-boimd 

proteins (G-proteins) produce the ON and OFF signaling states that act as gates for downstream 
biochemical processes. Experimental results described herein, in view of the results of studies 
by others, suggest that a similar molecular switch relies on adenine nucleotide-bound forms 
(A-proteins) to produce an ON and OFF signaling state related to mismatched DNA repair and 
10 possibly to other processes. In the field of signal transduction, the concept of a molecular 

switch is elementary, while the biochemical processes of DNA repair appear foreign. Similarly, 

^ the field of DNA repair recognizes the complex machinery required for DNA manipulation 

events, but regards biochemical signaling processes as essential cellular input which is outside 

© the genome juggernaut. 

T 15 

Genetics of Mismatch Repair 

[0253] There are at least three ways in which mismatched nucleotides arise in DNA. 
51 Physical or chemical damage to the DNA and its precursors, such as de-amination of 

5-methyl-cytosine, can give rise to mismatched bases (Friedberg, 1990, DNA Repair W.H. 

20 Freeman Co., New York). Misincorporation of nucleotides during DNA replication can yield 
mismatched base pairs as well as the insertion and deletion of nucleotides (for review see: 
Kolodner, 1996, Genes Dev. 10:1433-1442; Modrich, 1989, J, Biol. Chem. 264:6597-6600; 
Modrich, 1997, J. Biol. Chem. 272:24727-24730). Genetic recombination produces regions of 
heteroduplex DNA which may contain mismatched nucleotides when such heteroduplexes 

25 result from the pairing of two different parental DNA sequences (HoUiday, 1964, Genet. Res. 
5:282-304). Mismatched nucleotides produced by each of these mechanisms are known to be 
repaired by enzyme systems that are both specific and overlapping (Friedberg, 1990, DNA 
Repair, W.H. Freeman Co., New York). 

[0254] The most extensively studied system for mismatch repair (MMR) is the DNA 
30 adenine methylation (Dam)-instructed pathway of Escherichia coli (Modrich, 1 989, J. Biol. 
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Chem. 264:6597-6600; Modrich and Lahue, 1996, Annu. Rev. Biochem. 65:101-133). The 
Dam-Instructed pathway promotes a long-patch (approximately 2 kilobase pair) excision repair 
reaction which is genetically dependent on the mutH, mutL, mutS, and mutU (uvrD) gene 
products. Discrimination of the newly replicated DNA strand from the original template DNA 
5 strand is dependent on transient under-methylation of the adenine nucleotide within GATC 
Dam sequences. The MutHLS pathway appears to be the most active MMR pathway in E. coli 
and is known to both increase the fidelity of DNA replication as well as to act on recombination 
intermediates containing mis-paired bases (Fishel et al., 1983, UCLA Symp. MoL Cell. Biol. 
New Series 11:309-324; Fishel etaL, 1986, J. MoL Biol. 188:147-157). 
10 [0255] Homologs of prokaryotic MutS and MutL proteins have been identified in nearly 

s|3 every organism with the exception of Archaea (Fishel et al., 1997, Curr. Opin. Genet. Dev. 

IJt 7:105-113; Kolodner, 1996, Genes Dev. 10:1433-1442). At present, there are 41 MutS 

homologs and 21 MutL homologs in the NCBI database. In S. cerevisiae, six MutS homologs 

0 (MSHl - MSH6) and three MutL homologs (MLHl, MLH2, PMSl) have been identified. In 

15 human cells, a nearly identical set of five MutS homologs (hMSH2 - hMSH6) and three MutL 

;ig homologs (hMLHl , hPMS 1 , and hPMS2) are known (Acharya et al., 1 996, Proc. Natl. Acad. 

jU Sci. USA 93:13629-13634; Bronner et al., 1994, Nature 368:258-261; Burns et al., 1994, Genes 

0 Dev. 8:1087-1105; Fishel etal., 1993, Cell 75:1027-1038; Fujii etal., 1989, J. Biol. Chem. 

264:10057-10064; HoUingsworth et al., 1995, Genes Dev. 9:1728-1739; Kramer et al., 1989, J. 
20 Bacteriol. 171:5339-5346; Linton et al., 1989, Mol. Cell. Biol. 9:3058-3072; Mankovich et al., 
1989, J. Bacteriol. 171 :5325-5331 ; New et al., 1993, Mol. Gen. Genet. 239:97-1 08; Nicolaides 
et al., 1994, Nature 371:75-80; Palombo et al, 1995, Science 268:19121-19914; ProUa et al., 
1994, Mol. Cell. Biol. 14:407-415; Reenan et al., 1992, Genetics 132:963-973). Yet, with the 
exception of gram-negative bacteria, there do not appear to be homologs of MutH. Thus, the 
25 mechanism of strand discrimination in even close relatives of E. coli, the gram-positive 
bacteria, remakis a mystery. The multiple MutS and MutL homologs have been found to 
participate in the diverse activities of nuclear (MSH2, MSH3, MSH6, MLHl, PMSl) and 
organellar (MSHl) post-replication mismatch repair as well as having distinct meiotic functions 
(MSH4, MSH5) (Fishel et al, 1997, Curr. Opin. Genet. Dev. 7:105-1 13; Kolodner, 1996, 
30 Genes Dev. 10:1433-1442). 
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Biochemistry of Mismatch Repair 

[0256] Purification and reconstitution studies by Modrich and colleagues have led to a 
biochemical model for post-replication mismatch repair in E. coli. The reconstituted system 
5 requires the MutH, MutL, MutS and UvrD (helicase II) proteins along with DNA polymerase 
III holoenzyme, DNA ligase, single-stranded DNA binding protein (SSB) and one of the 
single-stranded DNA exonucleases, Exol, ExoVII or RecJ (Cooper et al., 1993, J. Biol. Chem. 
268:11823-11829; GriUey et al., 1989, J. Biol. Chem. 264:1000-1004; Lahue et al., 1989, 
Science 245:160-164; Lu et al., 1983, Proc. Natl. Acad. Sci. USA 80:4639-4643; Su et al., 
10 1986, Proc. Natl. Acad. Sci. USA 83:5057-5061 ; Welsh et al., 1987, J. Biol. Chem. 
O 262 : 1 5624- 1 5629). In this widely held biochemical model, initiation of a MMR event occurs 

when MutS recognizes and binds mis-paired nucleotides that result from polymerase 
5 misincorporation errors (Su et al., 1986, Proc. Natl. Acad. Sci. USA 83:5057-5061). It is 

!K suggested that MvitS mismatch binding is followed by interaction with the MutL protem 

•0 15 (GriUey et al., 1989, J. Biol. Chem. 264: 1 000- 1 004), which has been proposed to accelerate an 
13 ATP-dependent translocation of the MutS-MutL complex (Allen et al., 1997, EMBO J. 16: 

* 4467-4476) to a hemi-methylated GATC Dam site bound by MutH (Welsh et al., 1987, J. Biol. 

5^ Chem. 262: 1 5624-1 5629). The MutS-MutL complex then stimulates an intrinsic endonuclease 

f * activity of MutH which resuhs in a specific strand scission on the non-methylated newly 

20 replicated DNA strand (Cooper et al, 1993, J.Biol. Chem. 268:11823-11829; Lahue etal., 

1989, Science 245:160-164; Welsh et al., 1987, J. Biol. Chem. 262:15624-15629). This strand 
scission directs one of three single-stranded exonucleases (RecJ, Exo I, ExoVII) to degrade the 
newly replicated strand, which is then re-synthesized by the PolIII holoenzyme complex (Lahue 
et al., 1989, Science 245:160-164). The net result is a strand-specific mismatch repair event 
25 which can be bidirectional. Many of the genetic studies performed with this system appear to 
support this biochemical interpretation. For example, mutH, mutL, and mutS bacteria exhibit a 
mutator phenotype that is presumed to be the result of the increased probability of 
misincorporation errors leading to mutations (Demerec et al., 1957, Carnegie Inst. Wash. 
Yearbook 370:390-406; Hill, 1970, Mutat. Res. 9:341-344; Miyake, 1960, Genetics 
30 45:755-762; Siegel et al, 1967, J. Bacteriol. 94:38-47). However, not all predictions arising 
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from this model agree with the genetic results. For example, recJ exol exoVII bacteria do not 
appear to exhibit a mutator phenotype (Harris et al., 1998, J. Bacteriol. 180:989-993), 
suggesting that there may be other exonuclease(s) or mechanism(s) involved in the mismatch 
repair process. 

5 

Functions for the Mismatch Repair Proteins 

[0257] An activity exhibited by mismatch repair proteins is the specific mis-pair binding 
activity ascribed to MutS homologues (Acharya et al., 1996, Proc. Natl. Acad. Sci. USA 
93:13629-13634; Chi et al., 1994, J. Biol. Chem., 269:29984-29992; Drummond et al., 1995, 
10 Science 268:1909-1912; Fishel et al., 1994, Science 266:1403-1405; Gradia et al, 1997, Cell 
O 91:995-1005; Marsischky et al., 1996, Genes Dev. 10:407-420; Su et al, 1986, Proc. Natl. 

Acad. Sci. USA 83:5057-5061). A clear flmction of the MutL homologs has, until the present 
5 invention, not been clear. Classification of MutS and MutL homologs is based on the 

recognition of highly conserved regions of amino acid identity. The most highly conserved 
■C 15 region of the MutS homologs is confined to a region of approximately 150 amino acids that 
C5 encompass a helix-turn-helix domain associated with a Walker-A adenine-nucleotide and 

J magnesium binding motif Such motifs were described by Walker et al (1 982, EMBO J. 

W 1 :945-95 1). This adenine nucleotide binding domain constitutes 100% of the identity between 

|i the known MutS homologs (Fishel et al, 1997, Curr. Opin, Genet. Dev. 7:105-1 13). Purified 

20 bacterial, yeast, and human MutS homologs exhibit an intrinsic low-level ATP hydrolytic 

(ATPase) activity (Alani et al, 1997, Mol. Cell. Biol. 17: 2436-2447; Chi et al, 1994, J. Biol. 
Chem., 269:29984-29992; Gradia et al, 1997, Cell 91:995-1005; Haber et al, 1991, EMBO J 
10:2707-2715). This ATPase activity is likely to be important for the function of the MutS 
homologs, as evidenced by the observation that mutation of a conserved lysine residue in the 
25 adenine nucleotide binding domain results in a dominant mutator phenotype in both bacteria 
and yeast (Alani et al, 1997, Mol. Cell. Biol. 17: 2436-2447; Haber et al, 1991, EMBO J. 
10:2707-2715). 

[0258] The most widely held model for MMR suggests MutS mis-pair binding is followed 
by MutL association that results in an energy dependent translocation of this complex to a 
30 hemi-methylated Dam site occupied by the MutH protein. In retrospect, this appears to have 
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been a simplistic view since the rate of ATP hydrolysis (k^^t^ ^ unlikely to be 

sufficient to drive mechanical translocation the, on average, several hundred to thousand 
nucleotides required to encounter a MutH bound hemimethylated site. For example, if one ATP 
was required to translocate one nucleotide, as the most well accepted mechanism suggests, then 
5 it would take 25-1 00 minutes to encounter a MutH on average. Yet, re-methylation of the 
transiently hemimethylated Dam sites has been found to occur within 0.1 to 3 minutes of 
passage of the replication fork (Campbell et al, 1990, Cell 62:967-979), While the ATPase 
activity could in theory be significantly faster in vivo, no stimulatory factor has been identified 
to date in spite of an extensive search. In addition, the prevailing mechanism does not 
10 adequately account for MutL function nor the highly conserved domains recognized between 
O MutL homologs from bacteria to man (regions containing 1 00% identity in 21 homologs). 

:, 

■;| The hMSH2-hMSH6 Molecular Switch 

:^ [0259] As described herein in Example 2 and elsewhere, human MutS homolog dimers, 

vfl 15 such as the hMSH2:hMSH6 heterodimer, function as molecular switches responsible for the 
Q timing of mismatch repair, as illustrated in Figure 7. This conclusion is based on the 

fll observations that: 

1) The ADP-bound heterodimer has high affinity for mismatched nucleotides; 
U 2) exchange of ADP for ATP results m release of the heterodimer from mismatched 

20 duplex DNA in the absence of hydrolysis; 

3) release of the heterodimer from mismatched duplex DNA occurs by 
hydrolysis-independent diffusion off the ends of the short oligonucleotides used in the 
experiments described in Example 2, as confirmed by the experiments described in Example 4 
herein; and 

25 4) hydrolysis of ATP results in recovery of the mismatch-binding competent 

ADP-bound heterodimer. 

[0260] The rate-limiting step and the ultimate control of the hMSH2:hMSH6 molecular 
switch is likely to be ADP to ATP exchange, which is exceedingly inefficient in the absence of 
mismatched duplex DNA. The characteristics of the hMSH2:hMSH6 heterodimer appear 
30 analogous to the characteristics of G-protein mediators of seven-transmembrane (7-TM) 
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domain receptor signaling such as that used by the beta- Adrenergic and Rhodopsin Receptors 
and the prototypical oncoprotein/G-protein Ras (Tocque et al, 1997 Cell Signal. 9:153-158), 
More specifically, the observation that the hMSH2:hMSH6 heterodimer is induced to exchange 
ADP for ATP in the presence of mismatched duplex DNA and then dissociates from the 
5 mismatched portion of the duplex DNA to transduce a signal, is analogous to the observation 
that ligand binding by 7-TM receptors induces associated G-proteins to exchange GDP to GTP 
and dissociate from the receptor to transduce a signal. 

[0261] These similarities suggest two related models for mismatch repair that are 
fundamentally different from all previously suggested models. These models axe each based on 
10 the concept that MutS and its homologs are a novel type of molecular switch which determines 
O the timing and/or appropriate assembly of repair components. The apparent affinity of the 
M hMSH2:hMSH6 heterodimer for mismatched duplex DNA (Kd = about 2-20 nanomolar) 

% suggests that a single mismatch in a human cell should be efficiently recognized and bound. 

JS Furthermore, binding of the hMSH2:hMSH6 heterodimer to mismatched duplex DNA is 

''0 15 slightly stabilized in the presence of ADP. We would propose two non-exclusive models. 
13 [0262] In the first model, tight binding of the ADP-bound form of the hMSH2:hMSH6 

j| heterodimer to mismatched duplex DNA acts as a flag for the assembly or nearby localization 

Ij^ of DNA excision repair components. When the complete excision repair complex is assembled, 

exchange of ADP for ATP is triggered and the hMSH2:hMSH6 heterodimer is released from 
20 the mismatched portion of the duplex DNA, thus signaling exonucleolytic excision and 
resynthesis of the region containing the mismatched nucleotide. Once released from the 
mismatched portion of the duplex DNA, the intrinsic ATPase activity of hMSH2-hMSH6 
hydrolyzes bound ATP, resulting in a form that is once again competent for mis-pair binding. 
[0263] In the second model, recognition of mismatched duplex DNA by the ADP-bound 
25 form of the hMSH2:hMSH6 heterodimer provokes ADP to ATP nucleotide exchange. ATP- 
hydroly sis-independent DNA-associated difftision of the hMSH2:hMSH6 heterodimer away 
from the mismatch portion of the duplex DNA to the assembled (or partially assembled) DNA 
mismatch repair complex. Activation of these components by the confederation of the 
ATP-bound form of the hMSH2:hMSH6 heterodimer either engages the repair process 
30 (signaling the timing of mismatch repair as above) or triggers assembly of the remaining DNA 
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mismatch repair components. This activation event results in release of the hMSH2:hMSH6 
heterodimer from the duplex DNA, hydrolysis of ATP bound to the hMSH2:hMSH6 
heterodimer, and recycUng of the form of the hMSH2:hMSH6 heterodimer capable of 
associating with mismatched duplex DNA. An advantage of this second model is that the 

5 hMSH2:hMSH6 heterodimer remains associated with the DNA in an activated-form, poised to 
transduce the mismatch signal to any nearby mismatch repair components. 
[0264] As a free protein complex, the hMSH2:hMSH6 heterodimer does not efficiently 
exchange ADP remaining after hydrolysis of ATP bound thereto, providing a long-term 
mismatch recognition-competent molecule. A key difference in the mismatch repair models 

10 described above and those previously proposed, is the concept that ATP hydrolysis is not 
required to physically transduce the mismatch binding signal to downstream DNA mismatch 
repair components. Instead, ATP hydrolysis is required only to recycle the mis-pair recognition 
component (i.e. the hMSH2:hMSH6 heterodimer). Without wishing to be bound by any 
particular theory, it is thought that the signal state of the hMSH2:hMSH6 heterodimer is related 

15 to the conformational state of the heterodimer, which in turn is related to whether ADP or ATP 
is bound thereto. 

[0265] One of the most important observations concerning G-proteins is their regulation by 
associated proteins (Bokoch et al., 1993, FASEB J. 7: 750-759). There are two halves to the 
GTPase cycle: gamma-phosphate hydrolysis and GDP to GTP nucleotide exchange. Both of 

20 these steps can be regulated either by inhibition or acceleration of these partial reactions. For 
example, the Ras protein has an remarkably sluggish intrinsic GTPase activity (Trahey et al., 
1987, Mol. Cell. Biol. 7:541-544), which can be accelerated at least 104-to 105-fold by a 
GTPase Activating Protein (GAP) (Trahey et al., 1987, Science 238:542-545). In addition, 
there are other Regulators of G-Protein Signaling (RGS) that singularly accelerate GTP 

25 gamma-phosphate hydrolysis, and GDP to GTP exchange stimulators (GES) and guanine 

dissociation inhibitors (GDI) that singularly affect nucleotide exchange (Dohlman et al, 1997, 
J. Biol. Chem. 272:3871-3874; QuiUiam et al., 1995, Bioessays 17:395-404; Tocque et al., 
1997, Cell Signal 9:153-158). It has been discovered herein that MutL homologs perform 
analogous functions (i.e. accelerate ATP gamma-phosphate hydrolysis, and ADP to ATP 

30 exchange) with respect to MutS homologs. 
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Biological Switches and the Second Law of Thermodynamics 

[0266] One could argue that the concept of a singular ON or OFF state in a molecular 
switch might violate the second law of thermodynamics. This law requires that biochemical 
systems transit one state to the other by a series of microscopically reversible steps. This idea is 
based in statistical mechanics as it is applied to a system at equilibrium - which must be applied 
a priori to enzyme catalyzed biological processes. It is easy to visualize the origins of the 
principle of microscopic reversibility by considering the consequences were it NOT true. For 
example, if the rate of A to B were greater than B to A at equilibrium, each of the rates B to C, 
C to and D to A would also have to be greater than their reverse rates in order to prevent 
build-up of the concentration of any species, which is not permitted at equilibrium. In this case 
there would be a preferred direction-of-operation of the reaction cycle. Such a spontaneous 
cycle in a system at equilibrium (i.e. an engine that spontaneously produces work) is not 
consistent with the drive toward maximum entropy contained in the second law of 
thermodynamics. 

[0267] There is no violation of the second law of thermodynamics if the transit from an 
OFF to ON state (or visa versa) occurs reversibly. The molecular basis for this type of 
microscopic reversibility can be visualized for the MutS dimer and G-protein switches as 
reversible nucleotide-binding as well as intermediate protein conformational changes that occur 
while transiting the extreme states. It is these conformational transitions that determine 
interaction with effectors which is uhimately accounted for by the hydrolysis of NTP. More 
significantly, one can experimentally affect the equilibrium of each state by altering the ratio of 
NDP/NTP in the absence of any hydrolysis, as indicated in Figure 4B. It is also important to 
note that microscopic reversibility has been directly demonstrated for the "gated" maxi K"^ ion 
pump, which is a molecular switch controlled by similar conformational transitions (Song et al., 
1994, Biophys. J. 67:91-104). Thus, molecular switches are both reversible and, at equilibrium, 
clearly preserving a fundamental tenant of thermodynamics. 
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Similarities Between Signal Transduction and DNA Metabolism 

[0268] The use of controlled molecular switches appears to pervade all aspects of biology. 
From the standpoint of DNA metabolism, switch controlled processes appear mechanistically 
sensible. It is well known that the cellular components which perform replication, 
5 recombination, repair, and chromosome segregation are very large and composed of multiple 
subunits (Alberts, 1998, Cell 92:291-294). Analogous to an assembly-line for an automobile or 
an airplane, the assembly of DNA metabolic machines must be done precisely and in a specific 
order to ensure appropriate function. A series of well defined switches could logically control 
the progression of such an ordered assembly process. 
10 [0269] The same type of switch-controlled cascade events that transduce cellular signals 

may also control DNA metabolic events. An important difference between these switches is the 
identity of the nucleotide that induces the conformational transitions associated with signaling. 
At the moment the general rule seems to be that guanine nucleotides are involved in cellular 
signaling events and adenine nucleotides are involved in DNA metabolic signaling events. 



Example 4 

Interactions of hMSH2 with hMSH3 and of hMSH2 with hMSH6: 
Examination of Mutations Associated with HNPCC 

[0270] In the experiments described in this Example, mutations in the human mismatch 
20 repair protein hMSH2 were determined to co-segregate with the occurrence in individuals 
afflicted with hereditary non-polyposis colorectal cancer (HNPCC). As described herein, 
hMSH2 forms specific mis-pair binding complexes with hMSH3 and hMSH6. These protein 
interactions were further characterized by mapping the contact regions between the monomers 
of the hMSH2:hMSH3 and hMSH2:hMSH6 heterodimers. 
25 [0271] The resuhs described in this Example demonstrate that there are at least two distinct 
regions of monomer:monomer interaction in both hMSH2:hMSH3 and hMSH2:hMSH6 
heterodimers. The same regions of the hMSH2 monomer interact with regions of both the 
hMSH3 monomer and the hMSH6 monomer. Furthermore, there is a coordinated linear 
orientation of these regions, by which is meant that the amino-terminal region of hMSH2 
30 associates with the amino-terminal of either hMSH3 or hMSH6 and the carboxy-terminal 
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region of hMSH2 associates with the carboxy-terminal region of either hMSH3 or hMSH6. 
Several missense alterations of hMSH2 obtained from HNPCC kindreds were examined and 
were determined to occur within the consensus monomer:monomer interaction regions. None 
of these missense mutations prevented monomer :monomer interaction. These data support the 
idea that an altered interaction of hMSH2 with hMSH3 or an altered interaction of hMSH2 with 
hMSH6 is unlikely to be causative of HNPCC. 

[0272] In the experiments described in this Example the regions of monomer:monomer 
interaction were ascertained for hMSH2:hMSH3 and hMSH2:hMSH6 heterodimers. Two 
distinct interaction regions were identified for hMSH2:hMSH3 heterodimers and for 
hMSH2:hMSH6 heterodimers. The interaction regions of hMSH2 with either hMSH3 or 
hMSH6 appeared to be identical Several missense mutations of hMSH2 were constructed. 
These mutations have been reported by others to co-segregate with HNPCC. None of these 
alterations affected the interactions between hMSH2 and either hMSH3 or hMSH6 
heterodimers. 

[0273] The materials and methods used in the experiments presented in this Example are 
not described. 

Reagents and Enzymes 

[0274] Restriction endonucleases were obtained from New England Biolabs (Beverly, MA). 
PCR reactions were performed using the High Fidelity PCR Kit obtained from Boehringer 
Mannheim (Mannheim, Germany). Oligonucleotides were synthesized using an Applied 
Biosystems (Foster City, CA) model 3948 nucleic acid synthesis and purification system, DNA 
plasmid constructs were purified using Qiagen (Hilden, Germany) DNA purification kits. In 
vitro transcription and translation (IVTT) reactions were performed using the Promega 
(Madison, WI) TNT™ Coupled Rabbit Reticulocyte Lysate System. Radiolabeled ^^S 
methionine was used to label proteins and was obtained from Dupont NEN (Wilmington, DE). 
Glutathione linked (GST) agarose beads were purchased from Sigma (St. Louis, MO), 
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Subcloning of hMSH2 and hMSH3 

[0275] The cloning of hMSH2, hMSH3, and hMSH6 cDNAs and subcloning into pET 
expression vectors (obtained from Novagen) has been previously described (Acharya et al, 
1996, Proc. Natl. Acad. Sci. USA 93:13629-13634). In this study, we used a HeLa cDNA 
5 clone of hMSH3 (Gen Bank Accession U6 1 98 1 ). 

[0276] GST fusion proteins were synthesized using the pGEX system (Pharmacia, Sweden). 
For ease of cloning, plasmid pGEX-4T-2 was modified as follows. The vector DNA was 
digested using EcoRI and BamHI restriction endonucleases and purified by gel electrophoresis. 
A double-stranded linker oligonucleotide comprising a polynucleotide having the nucleotide 
10 sequence SEQ ID NO: 13 and a polynucleotide having the nucleotide sequence SEQ ID NO: 14 

0 was ligated into the vector. SEQ ID NO: 1 3 is 5'-GATCCGAGAA CCTGTACTTC 
3 CAGGGACATA TGGCCATGGG TACCG-3'. SEQ ID NO: 14 is 5'-AATTCGGTAC 

't- CCATGGCCAT ATGTCCCTGG AAGTACAGGT TCTCG-3'. The vector is herein referred 

1^ to as pGEX-SGl and permitted subcloning using Ndel and Ncol restriction endonuclease sites 

•sC 15 in which the ATG initiation codon within each site was in frame with the GST moiety. Vector 
Q pGEX-SGl also contained a TEV protease site just upstream of the Ndel and Ncol sites. 

1 ^ 

^ Construction of hMSH2 truncation mutations 

U [0277] The hMSH2 deletion mutants were constructed using known PGR truncation 

20 mutagenesis methods. 'Forward' primers were generated by adding a polynucleotide 

homologous with six codons corresponding to the desired 3'-end of Msh2, starting with a codon 
having a guanine residue in the 5'-position and adding the 17 nucleotides immediately 3*- with 
respect to that residue, to the 3'-end of a polynucleotide having the nucleotide sequence 
5'-GCGGATCCCA TGG-3' (SEQ ID NO: 15). 'Reverse' primers were generated by adding a 
25 polynucleotide homologous with the 18 nucleotides of the complementary strand corresponding 
to the six codons of desired 5'-end of Msh2 to the 3'-end of a polynucleotide having the 
nucleotide sequence 5'-GGAGGATCCC TA-3' (SEQ ID NO: 16). Using a forward and reverse 
primer, a PGR reaction was performed using pET3d-hMSH2 as template DNA. The PGR 
product and pET24d were digested with Ncol and BamHI, purified by gel electrophoresis, and 
30 ligated together. 
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[0278] To make truncated peptides containing an internal deletion, pET 24d-hMSH2 
(which did not encode amino acid residues 700-800 of hMSH2) was generated by performing 
PGR on hMSH2 using a pair of polynucleotide primers having sequences 5'-GCGGATCCCA 
TGGCAGAAGT GTCCATTGTG-3' (SEQ ID NO: 17) and 5'-GGAGGATCCC 
5 ATATGTAGAT TATTAACAGT TGG-3' (SEQ ID NO: 1 8). The amplification product and 
pET24d were digested using Ncol and BamHI, and the digested products were purified by gel 
electrophoresis and ligated together. The resulting vector permitted ligation of fragments using 
Ndel and BamHI. 'Forward' primers were designed using the first 18 nucleotides of the desired 
3'-end of msh2 ligated to the 3'-end of a polynucleotide having the sequence 
10 5'-GGCGGTATCC ATATG-3' (SEQ ID NO: 19). The reverse primer was the same as the one 
described earlier in this Example. PGR fragments were ligated into this vector using Ndel and 
BamHI. Site directed mutagenesis of hMSH2 was performed using overlap PGR, as described 
(Kallal et al, 1997, MoL Gell. Biol 17:2897-2907). All of the site directed mutations were 
J5 completely sequenced using a Perkin Elmer ABI Sequencer with XL upgrade (Perkin Elmer 

i 15 Getus, Norwalk, GT), 

Gonstruction of hMSH3 and hMSH6 truncation mutations 
W [0279] hMSH3 and hMSH6 triHication constructs were created using a method analogous to 

iji that used to generate to the hMSH2 deletion mutants. 'Forward' primers were generated using 

20 the same method described for designing hMSH2 'forward' primers for hMSH2 mutations 

having truncations. The reverse primers were generated using the same method described for 
designing hMSH2 'reverse' primers for hMSH2 mutations having either truncations or internal 
deletions, except that the polynucleotide had the sequence 5'-GGGATAGTGG AGGTA-3' (SEQ 
ID NO: 20), instead of SEQ ID NO: 16. The PGR amplification product was subcloned into 

25 either pET24d or pGEX-SGl . 

[0280] pET24d-hMSH3 (which did not encode amino acid residues 800-990 of hMSH3) 
was constructed by performing PGR using msh3 and a pair of polynucleotide primers having 
sequences 5'-GGGGATGGGA TGGATTTTGT AGAGAAATTG-3' (SEQ ID NO: 21) and 
5'-GGAGGGGTGG TGGAGGTAAG GGGTATGTGT GATGAAATAG TG-3' (SEQ ID NO: 

30 22). The amplified product and pET24d were digested using restriction endonucleases Ncol 
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and Sail and subcloned. This vector permitted ligation of inserts using restriction 
endonucleases Agel and XhoL Forward primers were generated by ligating six codons 
corresponding to the desired 3'-end of msh3 to a polynucleotide having the sequence 5'- 
GCGGTGACCG GT-3' (SEQ ID NO: 23). Reverse primers were generated as described 
5 earlier, only homologous with the non-coding strand of msh3. PGR was performed, and the 
amplified products were ligated. 

[0281] In order to avoid errors introduced by random PGR mutagenesis, all PGR 
amplification products were either completely sequenced or the experiments were conducted 
using two separately isolated PGR products. 

10 

O GST Fusion Protein Interaction assay 

J [0282] An overnight culture of E. coli XL-blue cells which harbored pGEX-hMSH(X) (i.e. 

'X' being 2, 3, or 6) was grown in LB with 50 milligrams per milliliter ampicillin. 50 milliliter 
)^ of Luria broth containing ampicillin was inoculated with 1 milliliter of the overnight culture, 

'>0 15 and the cuhure was incubated until the optical density, as assessed at 600 nanometers, was 

about 0.5. IPTG was added to a final concentration of 0. 1 millimolar, and the culture container 
^ placed in a shaker at 30°C for 2 hours to generate induced cells. Induced cells were pelleted 

^Jf and resuspended in 800 milliliters of phosphate buffered saline (Boehringer Marmheim, 

i'^- Germany) containing protease inhibitors (0.5 millimolar PMSF, 0.8 milligrams per milliliter 

20 leupeptin, 0.8 milligrams per milliliter pepstatin, and 0.1 millimolar EDTA). Lysozyme was 
added to a concentration of 1 milligram per milliliter, and the mixture was incubated on ice for 
30 minutes. Triton X-100 and dithiothreitol were added to final concentrations of 0.2% (v/v) 
and 2 millimolar, respectively. The lysate was firozen and thawed twice to completely lyse the 
cells. DNase (Boehringer Marmheim, Germany) was added to a final concentration of 20 
25 micrograms per milliliter, and the lysate was incubated on ice for an additional 20 minutes. 
Cell debris was removed by centrifuging the lysate at 14,000 rpm in a refrigerated Eppendorf 
(Model 5402) centrifuge for 30 min. The supernatant was transferred to a new microfuge tube 
which contained rehydrated GST-agarose beads in a proportion whereby approximately 10-50 
nanograms of protein were present for every 25 microliters of beads that were present. GST- 
30 fusion protein levels were quantified as described herein. The lysate was incubated with the 



-95 - 



GST-agarose beads at 4''C on a rocking platform. After rocking for 1-2 hours, the incubation 
mixture was centrifuged at 1000 rpm in an Eppendorf microfuge for 30 seconds, the supernatant 
removed, and the beads were gently resuspended in 500 milliliters of Binding Buffer. Binding 
Buffer consisted of 20 millimolar Tris, pH 7.5, 10% (v/v) glycerol, 150 millimolar NaCl, 5 
5 millimolar EDTA, 1 millimolar DTT, 0.1% (v/v) Tween 20, 0.75 milligrams per milliliter BSA, 
0.5 millimolar PMSF, 0.8 milligrams per milliliter leupeptin, and 0.8 milligrams per milliliter 
pepstatin. The centrifiigation and re-suspension was repeated three times to wash the beads 
substantially free of non-specific lysate proteins. Suspended beads were added to a 14 milliliter 
sterile polypropylene tube, diluted with Binding Buffer to approximately 50 microliters of 
10 packed glutathione beads per milliliter and incubated at 4°C on a rocking platform for 30 
O minutes in order to allow BSA to coat the beads„ 500 milliliters of these coated GST-fusion 

hQ protein associated glutathione beads, which comprised about 10-50 nanograms of bound 

^ GST-fusion protein, was then aliquoted into 1 .5 miUiliter microfuge tubes. GST-fusion protein 

% expression levels were quantitated by Coomassie Brilliant Blue staining of protein separated by 

'0 15 SDS-PAGE gels, using BSA as a standard. 

Q [0283] In vitro transcription and translation (I VTT) reactions involving S-Methionine 

^l were performed with pET-hMSH(Y) (i.e. where 'Y' was 2, 3, or 6) using purified DNA 

5]f according to the manufacturers recommendations. IVTT reactions were pre-run to determine 

1^*^ the relative molar concentration of each construct. This value v^as calculated using the specific 

20 activity of ^^S-Methionine, correcting for the number of methionine residues in each IVTT 
construct and using SDS-PAGE and a Molecular Dynamics Phosphorlmager device equipped 
with ImageQuant software (Sunnyvale, CA). Up to 10 microliters of the IVTT protein was 
added to each tube such that each sample contained an approximately equimolar concentration 
of IVTT protein. An IVTT reaction which used pET24d as the vector was added to normalize 
25 the total amount of IVTT mixture in each tube. The tubes were incubated for at least one hour 
at 4'^C on a rocker. The beads were washed three times with the Binding Buffer and 
resuspended in 50 microliters of SDS loading buffer, which consisted of 0.25 Tris, pH 6.8, 5% 
(w/v) sucrose, 2% (w/v) SDS, 5%) (v/v) 2-mercaptoethanol, and 0.005% (w/v) bromophenol 
blue. Samples were resolved by SDS-PAGE, and imaged using the Molecular Dynamics 
30 Phosphorlmager (Suimyvale, CA). 
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[0284] It is recognized that the GST-IVTT interaction assay system is not quantitative, and 
may depend on the relative association constant (k^ss^c) which is related to the concentration 
of interacting peptides. Thus, subtle changes in the relative peptide concentrations may obscure 
potentially altered interactions. In order to provide control for such concentration-dependent 
5 processes between experiments, the molar concentration of the GST-fusion protein and the 
molar concentration of the IVTT sample were determined. Furthermore, clear changes in 
interaction between hMLHl and hPMS2 were observed by the inventors using a similar assay 
system that correlates with alterations known to be mutations, rather than polymorphisms. 
[0285] The results of the experiments presented in this Example are now described. 

10 

O GST Interaction Assay 

i|3 [0286] As described elsewhere herein, a physical interaction may be demonstrated between 

S hMSH2 and either of hMSH3 and hMSH6 using immunoprecipitation (IP) reactions v^ith 

anti-hMSH2 antibodies, which have been described in the art and are pxiblicly available. 
''G 15 However, interaction-region mapping experiments using truncation mutants of hMSH3 and 
^ hMSH6 resuhed in elevated background as a result of anti-hMSH2 antibody binding to the 

rfl truncated probes. In addition, this IP assay did not appear sensitive enough to detect weak 

interactions. 

[0287] For these reasons, the alternative assay described herein was developed. This assay 
20 relies on the use of a GST-fusion protein expressed in E. coli as a "bait" and in vitro 

transcribed and translated (IVTT) protein as "prey". This assay proved to be effective for all of 
the GST-fusion MutS homolog probe combinations used in the studies described in this 
application. These GST-fusion MutS homolog probe combinations included 
GST-hMSH2:IVTT-hMSH3, GST-hMSH3 :1 VTT-hMSH2, GST-hMSH2:IVTT-hMSH6, and 
25 GST-hMSH6:IVTT-hMSH2. The interaction for each of these IVTT full-length peptides was 
specific for the corresponding GST-hMSH(X) fusion protein, as evidenced by the observation 
that nearly undetectable background non-specific binding was demonstrated by incubation and 
centrifugal precipitation of the IVTT-MSH(Y) with: 
1) GST-agarose beads alone; 
30 2) E. coh lysate +GST-agarose beads; and 
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3) pGEX (the GST moiety alone) +GST-agarose beads as controls. Furthermore, 
densitometric comparison of the PAGE lanes containing only pGEX with PAGE lanes 
containing GST-hMSH(X) demonstrated that the signal-to-background ratio in this assay 
approaches 100. These results suggested that this bait-prey system was sufficient to map the 
5 interaction regions of the hMSH2-hMSH3 and the hMSH2-hMSH6 heterodimers. 

[0288] In these studies, a clear interaction between MSH homologs could be demonstrated 
by comparing association of GST alone and IVTT-MSH(Y) with association of GST-MSH(X) 
and IVTT-MSH(Y), where X and Y are independently 2, 3, or 6. Furthermore, this assay 
provided a qualitative measure of interaction efficiency, because each experiment contained a 
10 nearly identical molar ratio of GST-MSH(X) and IVTT-MSH(Y). In addition, the 

0 GST-hMSH3 and GST-hMSH6 fusion proteins were demonstrated to be active for mis-pair 

^fi binding when they are combined with purified hMSH2. These results indicate that the structure 

of the hMSH3 and hMSH6 proteins is not substantially altered by fusion to GST. 

1 15 Interaction Regions of hMSH2 and hMSH3 

[0289] The regions of hMSH3 which interact with hMSH2 were determined, as illustrated 
in Figure 12. Truncated hMSH3 polypeptides were constructed such that the protein was 
represented by three overlapping polypeptides^ as illustrated in Figure 12, polypeptides 2-4. It 
was determined that there are two separate regions of hMSH3 that interact with hMSH2. It was 

20 recognized that an amino-terminal region of hMSH3 and a carboxy-terminal region of hMSH3 
are involved in interactions with hMSH2, as illustrated, for example, by the abilities of 
polypeptides 5 and 10 in Figure 12 to interact with GST-hMSH2. The amino-terminal region 
was determined to be located within the region of hMSH3 bounded by amino acid residues 126 
and 250, as indicated by the abilities of polypeptides 6-9 in Figure 12 to interact with GST- 

25 hMSH2. Because the level of IVTT expression was insufficient for polypeptides comprising 

fewer than one hundred amino acids, the carboxy-terminal region was mapped using an internal 
deletion strategy. Using this strategy, the carboxy-terminal interaction region was determined 
to be located within the region of hMSH3 bounded by amino acid residues 1050 and 1 128, as 
indicated by the abilities of polypeptides 10-14 in Figure 12 to interact with GST-hMSH2. 
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[0290] The locations of regions of hMSH2 which interact with hMSH3 were determined in 
a similar fashion. The regions of hMSH2 which interact with hMSH3 were determined, as 
illustrated in Figure 13. Truncated hMSH2 polypeptides were constructed such that the protein 
was represented by four overlapping polypeptides, as illustrated in Figure 13, polypeptides 2-5. 
5 It was determined that hMSH2 comprises two regions which are involved in interaction with 
hMSH3, as indicated by the abilities of polypeptides 1-6 in Figure 13 to interact with GST- 
hMSH3. An amino-terminal region was determined to be located within the region of hMSH2 
bounded by amino acid residues 378 and 625, as indicated by the abilities of polypeptides 7-10 
in Figure 13 to interact v^th GST-hMSH3. The amino acid boundaries of the carboxy-terminal 
10 interaction region of hMSH2 were not resolved with precision, due to sub-optimal signal 
G strength. Nonetheless, the data illustrated in Figure 13 indicate that the carboxy-terminal 

hf} interaction region of hMSH2 may at least be localized in the region bounded by amino acid 

residues 751 and 934 (the carboxy terminus), as indicated by the abilities of polypeptide 6 in 
::| Figure 13 to interact with GST-hMSH3. 

^0 15 [0291] Because there were two interaction regions between hMSH2 and hMSH3, a system 
O was designed to determine the linear orientation of the two regions. GST fusion proteins 

^ comprising truncated hMSH3 polypeptides were constructed. A GST-hMSH3 fusion protein 

W comprising hMSH3 amino acid residues 1-297 comprised the consensus amino-terminal 

interaction region. A GST-hMSH3 fusion protein comprising hMSH3 amino acid residues 
20 1025-1 128) comprised the consensus carboxy-terminal interaction region. These two fusion 
proteins were used as "bait" against a series of hMSH2 "prey" truncation mutants. We found 
that non-truncated hMSH2 interacted with both the GST-hMSH3 fusion protein, as indicated by 
the ability of polypeptide 1 in Figure 14 to interact with both the GST-hMSH3 fusion protein 
comprising the consensus amino-terminal interaction region and the GST-hMSH3 fusion 
25 protein comprising the consensus carboxy-terminal interaction region. The GST-hMSH3 
fusion protein comprising the consensus amino-terminal interaction region interacted most 
strongly with amino acid residues 251-750 of hMSH2 protein, as indicated by the ability of 
polypeptide 4 in Figure 14 to interact with this GST-hMSH3 fusion protein. The GST-hMSH3 
fusion protein comprising the consensus carboxy-terminal interaction region interacted most 
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strongly with amino acid residues 751-934 of hMSH2 protein, as indicated by the ability of 
polypeptides 5, 6, 7, and 8 in Figure 14 to interact with this GST-hMSH3 fusion protein. 
[0292] These resuhs indicate that the amino-terminal interaction region of hMSH3 normally 
interacts with the amino-terminal interaction region of hMSH2 and that the carboxy-region 
5 interaction region of hMSH3 normally interacts with the carboxyl region interaction region of 
hMSH2. Use of the GST-hMSH3 fusion protein comprising the consensus carboxy-terminal 
interaction region permitted further resolution of the carboxy-terminal interaction region of 
hMSH2. It was determined that the carboxy-terminal interaction region of hMSH2 is bounded 
by amino acid residues 875 and 934, as indicated by the ability of polypeptide 8 in Figure 14 to 
10 interact with this GST-hMSH3 fusion protein. 

J Interaction Regions of hMSH2 and hMSH6 

I'; [0293] Using a similar strategy, the locations of the interaction regions of hMSH2 and 

hMSH6 were determined. It was recognized that an amino-terminal region of hMSH6 and a 
h§ 15 carboxy-terminal region of hMSH6 are involved in interactions with hMSH2, as illustrated, for 
ir'i example, by the abilities of polypeptides 1-6 in Figure 15 to interact with GST-hMSH2. The 

amino-terminal region was determined to be located within the region of hMSH6 bounded by 
iV amino acid residues 326 and 575, as indicated by the abilities of polypeptides 7-10 in Figure 15 

1=^^^; to interact with GST-hMSH2. The carboxy-terminal region was determined to be located 

20 within the region of hMSH6 bounded by amino acid residues 953 and 1360, as indicated by the 
abilities of polypeptide 6 to interact with GST-hMSH2. 

[0294] The regions of hMSH2 which interact with hMSH6 were determined, as illustrated 
in Figure 16. Truncated hMSH2 polypeptides were constructed such that the protein was 
represented by four overlapping polypeptides, as illustrated in Figure 16, polypeptides 2-5. It 

25 was determined that hMSH2 comprises two regions which are involved in interaction with 
hMSH6, as indicated by the abilities of polypeptides 1-6 in Figure 15 to interact with GST- 
hMSH6. The amino-terminal region was determined to be located within the region of hMSH2 
bounded by amino acid residues 378 and 625, as indicated by the abilities of polypeptides 7-10 
in Figure 15 to interact with GST-hMSH6. Using a GST fusion protein which contained a 

30 truncation mutant of hMSH6 comprising amino acid residues 1302-1360, it was determined that 
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the carboxyl terminal interaction region of hMSH2 is located within the region of hMSH2 
bounded by amino acid residues 875 and 934^ as indicated by the ability of polypeptide 8 in 
Figure 17 to interact with this GST fusion protein. The ability of polypeptide 8 in Figure 17 to 
interact with this GST fusion protein also indicates that the carboxy-terminal interaction region 
5 of hMSH6 is bounded by amino acid residues 1302 and 1360. 

[0295] These results indicate that the same amino acid regions of hMSH2 are involved in 
the interactions between hMSH2 and hMSH3 and the interactions between hMSH2 and 
hMSH6. 

[0296] The linear orientation of the hMSH2-hMSH6 interaction regions was determined. 
10 Using IVTT amino-terminal and carboxy-terminal hMSH2 interaction regions and GST fusion 
^S proteins comprising the amino-terminal and carboxy-terminal interaction regions of hMSH6, it 

was determined that the amino-terminal interaction region of hMSH6 interacts with the 

amino-terminal interaction region of hMSH2, as indicated by the ability of polypeptides 3-5 in 
'^U Figure 17 to interact with the GST fusion protein comprising the amino-terminal interaction 

^ig 15 region of hMSH6. It v^as further determined that the carboxy-terminal interaction region of 
^-k hMSH6 interacts with the carboxy-terminal interaction region of hMSH2, as indicated by the 

ability of polypeptides 5-8 in Figure 17 to interact v^ith the GST fusion protein comprising the 
i 'U carboxy-terminal interaction region of hMSH6, Thus^ the linear orientation of the interaction 

regions of the hMSH2:hMSH6 heterodimer is identical to that of the hMSH2:hMSH3 
20 heterodimer. 



Interaction Regions of hMSH2 vnth Itself 

[0297] hMSH2 homodimers bind mismatched duplex DNA (Acharya et al, 1 996, Proc. 
Natl. Acad. Sci. USA 93:13629-13634). Using a GST-hMSH2 fusion protein comprising 
25 hMSH2 amino acid residues 751-934, it was determined that this portion of hMSH2 (i.e. the 
carboxy-terminal interaction region) interacts with the carboxy terminus of hMSH2. Thus, the 
hMSH2 homodimer exhibits the same carboxy-terminal interaction pattern that was observed 
between hMSH2 and either of hMSH3 and hMSH6. 
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The Effect of hMSH2 Mutations Observed in HNPCC Kindreds on hMSH(X):hMSH(Y) 
Interaction 

[0298] Several HNPCC-associated missense mutations have been described which are 
located within one of the two interaction regions of hMSH2 identified herein. Six of these 
5 HNPCC-associated mutations were constructed, and the effect of the mutations on 

hMSH(X):hMSH(Y) interaction were investigated, wherein X and Y are independently 2, 3, or 
6. The six HNPCC-associated mutations were those designated L390V, K393M, R524P, 
N596D, P622L, and T905R, These mutations are described in the HNPCC database 
(Peltomalei et al, 1997). 

10 [0299] Interaction experiments were performed using mutated hMSH2 fragments which 
O comprised either only an amino-terminal interaction region or a carboxy-terminal interaction 
%0 region to eliminate any confusion that the presence of multiple interaction regions might 

% generate. These hMSH2 IVTT mutant consensus interaction regions were examined for 

interaction with GST fusion proteins which comprised either full length hMSH3 or full length 
^ 0 15 hMSH6. No difference could be discerned between binding of any mutated hMSH2 fragment 
Q to either of the fusion proteins and binding of a corresponding wild type hMSH2 fragment to 

-ff: either of the fusion proteins. These results suggest that altered interaction between hMSH2 and 
;;]^ either hMSH3 or hMSH6 are not likely to be causative functional defects resulting in HNPCC. 

1*^ [0300] The results of the experiments described in this Example suggest a model for 

20 regional interactions of hMSH2 with hMSH3 and with hMSH6. This model is illustrated in 
Figure 18. The results described herein indicate that hMSH2 employs the same interaction 
regions, regardless of whether it interacts with hMSH3 or with hMSH6. These interactions are 
mediated by two distinct regions of hMSH25 an amino-terminal interaction region bounded by 
amino acid residues 378 and 625 and a carboxy-terminal interaction region bounded by amino 
25 acid residues 875 and 934. The adenine nucleotide binding region and the putative 

helix-tum-helix motif of hMSH2 are not contained within either of these regions. Thus, the 
results described in this Example indicate that it is unlikely that helix-turn-helix is essential for 
interaction of hMSH2 with liMSH3 or with hMSH6. Figure 18 illustrates both the relative 
positions and the linear orientation of the interaction regions of hMSH2, hMSH3, and hMSH6. 
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[0301] Since hMSH3 and hMSH6 appear to contact hMSH2 within the same binding 
regions, the amino terminal and carboxyl terminal regions of hMSH3 and hMSH6 were aligned 
and compared. The amino terminal interaction regions of hMSH3 and hMSH6 exhibited little 
identifiable homology. The carboxyl terminal interaction regions of hMSHS and hMSH6 
5 exhibited moderate homology, 16 of 60 residues being identical. The carboxyl-terminal regions 
of hMSHS and hMSH6 may provide a conserved function for these proteins such as, but not 
limited to, protein-protein interaction. 

Example 5 

hMSH5, A Human MutS Homolog that Participates 
in the Second Meiotic Division 

[0302] In the experiments presented in this Example, the human MSH5 protein (hMSH5) 
and the cDNA sequence encoding it are described. The msh5 gene is located at chromosome 
6p22-21, and is involved in meiosis, as evidenced by expression of mshS in the testes and 
confinement of such expression to secondary spermatocytes and developing spermatids. 
hMSH5 specifically interacts with hMSH4, confirming the generality of fimctional 
heterodimeric interactions in eukaryotic MutS homologs. The hMSH4:hMSH5 heterodimer 
may thus be analogized with the hMSH2:hMSH3 and hMSH2:hMSH6 heterodimers. 
[0303] The materials and methods described in the experiments presented in this Example 
are now described. 

Cloning the hMSH4 and hMSH5 cDNAs 

[0304] A search of the NCBI EST database indicated that a 466-base pair sequence derived 
from Soars human fetal liver spleen cDNA (T67203) exhibited significant homology with both 
25 yeast MSH3 and yeast MSHS. The amino acid sequence of the yeast and the human MSH2 
homologs were used to screen the Human Genome Sciences (HGS, Bethesda, MD) computer 
database using TFASTA computer software designed by the Genetics computer Group (GCG, 
University of Wisconsin). The HGS database contains nucleotide sequence information of 
expressed sequence tags (ESTs) which identify a diverse collection of cDNAs derived from 
30 more than 400 cDNA libraries (Adams et aL, 1991, Science 252:1651-1656). One EST 
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(designated C4) was determined to exhibit significant homology, but not identity, to yeast and 
human MSH2 and MSH3 protein sequences. 

[0305] Two PGR fragments were amplified using primers derived from these two EST 
sequences, which were identified in cDNA derived from human testis. The PGR product were 
5 used to screen a normal human testis cDNA library (obtained from Glontech, Palo Alto, GA) 
using conventional plaque hybridization techniques. One of the primer sets derived from G4 
yielded a consistent sequence and identified numerous phage clones. This set of primers 
comprised a forward primer (5'-AGGGGATGTT CAGAGGAAT-3'; SEQ ID NO: 3 1) and a 
reverse primer (5'-TGGAGTGGGA TTGTTGAGT-3^; SEQ ID NO: 32). Six clones were 
10 identified which were amplified using these primers, and these clones were excised using the 
^3 pDR2 phagemid, according to the manufacturer's recommendations. The six clones were 

^fl subcloned into pBSK (Stratagene, La JoUa, CA), and double strand sequencing of the six clones 

. t was performed using the PRISM™ Ready Reaction DyeDeoxy Terminator Gycle Sequencing 

Kit and an Applied Biosystems 377 Sequencer (Foster City, GA). 
"■0 15 [0306] One clone, designated b29, comprised an open reading frame (ORE) 2505 base pairs 
in length. This ORE comprised one STOP codon N-terminal to the start methionine codon and 
ifl one STOP codon at a position corresponding to the G-terminus of the protein encoded by the 

ORE, The completeness of the N-terminal region of the ORE was confirmed by performing a 
RAGE reaction using human normal testis cDNA (Glontech, Palo Aho, GA), as described (Apte 
20 et al, 1993, BioTechniques 15:890-893). The EST sequence obtained fi-om NGBI (T67203) 
was found to be located in the G-terminal portion of the b29 ORE, 
[0307] Glone b29 was further subcloned into pGEX (Pharmacia, Piscataway, NJ) for 
expression of the GST fusion protein in E. coli XLl Blue (Stratagene, La JoUa, GA) and into 
pET29a (Glontech, Palo Alto, GA) for in vitro transcription and translation (IVTT) using 
25 restriction endonucleases Ndel and Not! (New England Biolabs, Beverley, MA). 

[0308] An hMSH4 clone was obtained from human testis cDNA (Glontech, Palo Alto, GA) 
by PGR amplification and subsequent ligation into the pGR2.1 vector using a TA cloning kit 
(Invitrogen, San Diego, GA). The primer sequences which were used in these reactions 
included an outer forward primer (5'-GGAAGGTTTG GGAGGATGG TGAGG-3'; SEQ ID 
30 NO: 33), a reverse primer (5'-ATTGTGATTA TTGTTGAGTG TT-3'; SEQ ID NO: 34), a 
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nested PGR: forward primer (5'-ATCTCGAGAT GCTGAGGCCT GAG-3^; SEQ ID NO: 35), 
and a second reverse primer (5^-GCGCTAGCTT ATTCTTCAGT CTTTTC-3'; SEQ ID NO: 
36). The nucleotide sequence of the amplified clone was confirmed by complete double strand 
sequencing of both strands. 
5 [0309] The hMSH4 clone contained a deletion of a C residue in codon 1 8 and an insertion 
of a G residue in codon 20, resulting in VI 9S and V20S mutations. Furthermore, the hMSH4 
clone contained a G to A mutation at base 1219 of the published sequence (numbered starting 
with the A in the ATG initiator codon), which resulted in an E407K amino acid substitution. In 
addition, an apparent polymorphism at codon 368 (CGC to AGA) was detected, which does not 
10 alter the coding Arg. 

:Ji Chromosomal Mapping of hMSH5 

> [0310] PGR reactions were performed using the primers described above respectively, to 

'^G screen the GENEBRIDGE-4™ Radiation Hybrid Panel (Hudson et al., 1 995, Science 

hQ 15 270: 1 945-1 954). 35 amplification cycles were performed using an armealing temperature of 

60°C for 30 seconds followed by 72°C for 1 minute. Fragments were visualized by agarose gel 
If^ electrophoresis. 

i 

H Northern Blotting 

20 [0311] Three multiple tissue northern blots containing poly- A + RNA obtained from a total 
of 23 different human tissues were obtained from Clontech (Palo Alto, CA), 50 nanograms of a 
full length hMSH5 cDNA and a beta-actin cDNA control were radiolabeled using alpha-( P)- 
dCTP by random primed labeling (Boehringer Mannheim, Germany). Northern Blots were 
hybridized according to the manufacturer's instructions. The blots were washed in 2 x SSC 

25 containing 0.05% (w/v) SDS at room temperature (i.e, about 20''C) for a total of 60 minutes and 
at SO'^C in 0.1 x SSC, 0.1 % (w/v) SDS for a total of 40 minutes. Phosphorimager screens were 
exposed for one day. A 2.5-2.6 kilobase transcript was detected at a high level in testis. 
Tissues with significantly lower expression levels included bone marrow, lymph nodes, brain, 
and spinal cord. 
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Antibodies 

[0312] Five different 1 5-mer peptides were synthesized, each corresponding to predicted 
immunogenic regions of the hMSH5 protein. These peptides were conjugated to hemocyanin, 
5 and polyclonal antibodies were raised in rabbits (H.T.I. Bio-Products, Ramona, CA). Antibody 
clone C924-2 was found to be most sensitive and specific in Western Blot experiments and was 
purified over a Protein-A column for Western analysis. Further affinity purification of the 
antibody was performed using a crude lysate of SF9 insect cells overexpressing hMSH5 
protein. hMSH5 protein lysate was separated by SDS-PAGE, transferred to nitrocellulose and 
10 the hMSH5 specific region excised and used to affinity purify the antibody as described 
m (Wilson et aL, 1995, Cancer Res. 55:5146-5150). 

Immunohistochemistry 

''G [0313] 5-micron sections of formalin-fixed and paraffin embedded tissues were cut onto 

15 Neoprene coated slides (Aldrich Chemicals, Milwaukee, WI). After de-paraffinization, 

including a 30 minute methanolic peroxide block for endogenous peroxidase activity (Leica 
^ Autostainer, Leica, Deerfield, IL), the slides were subjected to microwave radiation in 200 

til milliliters of Chem.Mate H.LE.R buffer, pH 5.5-5.7 (Ventana Medical Systems, Tucson, AZ) at 

high energy for 5 minutes using a Panasonic Microwave #NN-5602A (Franklin PK, IL). 50 
20 milliliters of water were replaced for additional microwave exposure for 4 minutes at high 
energy . 

[0314] Immunostaining using the catalyzed signal amplification system (DAKO™, 
Carpinteria, CA) was performed according to the manufacturer's instructions. Incubation with 
Protein-A and hMSH5 specific affinity purified polyclonal antibody was performed for 50 
25 minutes at room temperature at concentrations of 1 :800 or 1 :2000, respectively, using the 
hMSH2 polyclonal antibody. For counter staining with Harris Hematoxylin (Surgipath, 
Richmond, IL), the Leica Autostainer was used. 
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GST Fusion Protein Interaction assay 

[0315] 500 microliters taken from a 5 milliliter overnight starter cultvire of cells which 
expressed an hMSH2-, hMSH3-, hMSH5-, or hMSH6-pGEX-fusion protein with (or non-fused 
5 pGEX as a negative control) was inoculated into 50 milliliters of Luria broth which contained 
50 micrograms per milliliter ampicillin, and this culture was grown until the optical density at 
600 nanometers was about 0.5. Protein expression was induced by addition of 0.1 millimolar 
(final concentration) IPTG for 2 hours at 30°C. Cells were pelleted and resuspended in 750 
microliters of phosphate buffered saline containing protease inhibitors. A 10 minute digestion 
10 on ice using 1 milligram per milliliter lysozyme was then performed. After the addition of 
O 0.2% (v/v) Triton X-100 and 1 millimolar dithiothreitol (final concentrations), the lysate was 

snap-frozen in liquid nitrogen and thawed twice. DNasel (200 units per milliliter; Boehringer 
'|S Mannheim, Germany) digestion was performed using the thawed lysate for 30 minutes on ice, 

after which cell debris was removed by centriftigation at 14,000 rpm at 4°C for 30 minutes in a 
15 benchtop microfuge. Equal amounts of ly sates obtained from cultures which separately 
IP expressed one of the fusion proteins (or GST alone as a negative control) w^ere incubated on a 

Jil rocking platform for 1 hour at 4°C in the presence of 2 milligrams of glutathione-agarose beads 

(Sigma Chemical Co., St. Louis, MO) which had been pre-swoUen in phosphate buffered saline 
1^^^ containing protease inhibitors for 1 hour at room temperature. The beads were washed three 

20 times with 500 microliters of Interaction Buffer, which comprised 20 millimolar Tris-HCl, pH 
7.5, 10% (v/v) Glycerol, 150 millimolar NaCl, 0.1% (v/v) Tween 20, 5 millimolar EDTA, I 
millimolar DTT, 0.75 milligrams per milliliter bovine serum albumin (Amresco, Solon, OH), 
and proteinase inhibitors). The beads were subsequently incubated in Interaction buffer for 1 
hour at 4°C on a rocking platform. 
25 [0316] In vitro transcriptions and translation (IVTT) reactions were performed using 1 

microgram each of hMSH2, hMSH3, hMSH5, and hMSH6 inserts (separately) in pET vectors 
and using the hMSH4 insert in pCR 2,1 using the TNT coupled reticulocyte lysate system 
(Promega, Madison, WI) according to the manufacturer's protocol. About 40 microcuries of 
S-methionine was incorporated into each protein. 5 microliters of individual IVTTs was 
30 added to 500 microliters of glutathione-agarose beads in Interaction buffer, and the mixture was 
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incubated for 1 hour at 4°C on the rocking platform. After three final washing steps, the 
supernatant was removed, and the beads were resuspended in 35 microliters of 2 x Spear's 
buffer, boiled for 5 minutes, and centrifuged for 5 minutes at 14,000 rpm in a benchtop 
microfuge. 15 microliters of each reaction mixture was loaded onto separate lanes of an 8% 
5 (w/v) SDS-PAGE Gel (BioRad MiniProtean II), and electrophoresis was performed for about 
90 minutes at 135 volts. Phosphorlmager screens (Molecular Dynamics) were exposed to the 
dried gels for one day. 

[0317] The results of the experiments described in this Example are now described. 

Isolation an chromosomal map of hMSH5, a new human MutS homolog 
[0318] Six clones which contained the EST later determined to correspond to msh5 were 
isolated, and the nucleotide sequence of both strands of the clone inserts were determined. 
Sequence analysis of clone b29 indicated the presence of an ORF 2505 base pairs in length. 
This ORF encoded putative 834-amino-acid protein, as indicated in Figures 19A-19C. The 
predicted molecular weight of the protein is 97 kilodaltons. A STOP codon was identified 
beyond the N-terminal end of the ORF, in the non-coding region, and the completeness of the 
ORF was confirmed by T-RACE analysis. 

[0319] The Genebridge-4 Radiation Hybrid Panel for PGR products having a length 
corresponding to this ORF. In this way, the mshS gene was located 6,94cR from D6S478 on 
chromosome 6p22.1-21.3. 

MSH5 defines a new family of MutS homologs involved in sporulation and meiosis 
[0320] Of all eukaryotic and prokaryotic MutS homologs, the b29 clone was found to be 
most closely related to Caenorhabdis eiegans MSH5 (29% amino acid identity) and 
25 Saccharomyces cerevisiae MSH5 (25% amino acid identity). A region encompassing the 

adenine nucleotide binding domain displayed approximately 60% amino acid identity among 
these homologues. The gene was therefore designated human msh5. 
[0321] Among MutS homologs, the next closest relatives to hMSH5 are the MSH2 
proteins. hMSH3 and hMSH6 proteins appear to be less closely related to hMSH5 than are the 
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bacterial MutS proteins. In the present alignment, the MSH4 proteins appear to be the most 
divergent of the MutS homologs. 

Expression of hMSH5 

[0322] Human mshS was determined to be transcribed at a high level in testis (Figure 3). 
These results correspond to the observation that, in yeast, MSHS expression was meiosis 
specific (Hollingsworth et aL, 1995, Genes Dev„ 9:1728-1739). The size of the human 
transcript corresponded to the length of the cDNA sequence, which is 2.5 kilobases. The 
presence of hMSH5 was detected in testis and tonsil tissue and, at very low levels, in two T- 
and B-cell tumor lines (Jurkat, CEM, Daudi, and GM 1500 cell lines) by Western Blot analysis. 
The Western signal in these autopsy tissues revealed low molecular weight protein band(s) that 
were likely degradation products of the significant autolytic reactions occurring in these 
samples. mshS expression was also observed in human bone marrow and lymph node tissues. 
The presence of msh5 transcript in human tissues where B- and T-cells develop as well as 
expression in the T- and B-cell lines suggests a relationship to cellular development processes 
that include recombination events. However, it is also possible that the low levels of hMSH5 
protein expression in the B- and T-cell lines could result fi*om the fact that the cell lines are 
derived from hematologic malignancies and thus do not represent normal B- and T-cell 
precursors or other undefined factors. hMSH5 expression may also occur in human brain, 
spinal cord, and trachea tissues. 

[0323] Western analysis suggested that several of the purified polyclonal antibodies derived 
from synthetic peptides are useful use immunohistochemical (IHC) studies. IHC stains for 
surgical specimens obtained from patients with various testicular tumors exhibited nuclear 
expression of hMSH5 in spermatids in statu nascendi in round and elongated spermatids (S3). 
In contrast, all of the preceding phases of spermatogenesis, as well as the spermatozoa 
themselves exhibited no expression of hMSH5. These observations indicate that hMSH5 has a 
specific role in the processes associated with the second meiotic division. 
[0324] The testicular histology of the orchiectomy specimens was not entirely normal. 
Thus, it is possible that hMSH5 was abnormally expressed in the testicular samples obtained 
fi-om surgical patients. In the samples examined, histological examination revealed occasional 
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intratubular neoplasia and the presence of discrete lymphocytic infiltrates. However, 
spermatogenesis in these samples was still functioning sufficiently to produce mature sperm 
cells and a number of tubules were found where there was no evidence of neoplasia. 
Furthermore, staining of spermatids was evident in all of the tubules that appeared normal 
5 based on the presence of all stages of spermatogenesis. Textbook examples of normal tubules 
would show the cell types of spermatogenesis filling the entire tubule. 
[0325] In contrast, hMSH2 is expressed in the nuclei at nearly all phases of 
spermatogenesis except for the round and elongated spermatids (where hMSH5 is expressed) 
and the spermatozoa. Sertoli cells exhibit faint nuclear staining with hMSH2-specific antibody. 
10 hMSH2 expression in tissue is clearly correlated with proliferation in general, which is 
13 exemplified in the experiments described in this Example by nuclear expression of h]V[SH2 in 

h§ the seminoma. In addition, tissues that were positive for hMSH2 expression were also positive 

^fc for expression of the proliferation marker Ki67. hMSH5 protein expression was absent in 

]^ seminoma and other testicular malignancies such as embryonal cell carcinoma and mature and 

15 immature teratoma. Expression of hMSH5 was absent in dividing spermatogonium A, 
suggesting that expression is not induced during mitosis. 

IIJ 

^ Protein Interaction Studies 

[0326] Because hMSH2, hMSH3 and hMSH6 are, as described herein, known to act as 
20 heterodimers, interaction studies of hMSH5 with hMSH2, hMSH3, hMSH4 and hMSH6 were 
performed. 

[0327] hMSH2 interacts strongly with hMSH3 and hMSH6, as described herein in Example 
1. IVTT-hMSH5 did not interact with GST-hMSH2, -hMSH3 or hMSH6 fusion proteins. 
Similarly, none of IVTF-hMSH2, -hMSH3, and -hMSH6 interacted with GST-hMSH5. The 

25 lack of interaction of hMSH5 with hMSH2, hMSH3, and hMSH6 was confirmed as the 
intensity of the bands never exceeded the background. However, there was significant 
interaction of GST-hMSH5 with IVTT-hMSH4. Furthermore, a significant interaction of GST- 
hMSH3 fusion protein with IVTT-hMSH4 was observed. However, this potential interaction 
could not be confirmed since significant amounts of soluble GST-hMSH4 fusion protein could 

30 not be obtained. These results suggest that hMSH5 specifically interacts with hMSH4 alone. 
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[0328] In yeast, msh5 mutants have decreased spore viability, increased levels of Meiosis I 
chromosomal nondisjunction and decreased levels of reciprocal exchange between, but not 
within, chromosomes (HoUingsworth et al, 1995, Genes Dev. 9:1728-1739). This observation, 
combined with the results described herein suggest that hMSH5, and thus also hMSH4, is 
5 involved in meiotic processing. hMSH5 is located on chromosome 6p22-21 and is expressed at 
very high levels in the testis where meiosis occurs continually throughout adult life, 
Immunohistochemical examination of testicular sections revealed that the protein expression of 
hMSH5 occurred in developing round and elongated spermatids. Spermatogonia and primary 
spermatocytes did not express hMSH5, and expression of hMSH5 ended abruptly upon 
10 development of mature sperm. Because the expression of hMSH5 is exceedingly strong in the 

O round spermatocytes, it is likely that expression of hMSH5 begins in the secondary 

spermatocyte. The expression pattern of hMSH5 is consistent with the phenotypes exhibited in 
yeast, since the meiosis I chromosomal non-disjunction occurs at the cellular division between 

Jif the primary and secondary spermatocyte, at the stage where the expression of hMSH5 is likely 

h§ 15 to be initiated. 

[0329] The observations described herein that hMSH5 was expressed in human tissues such 

|1 as bone marrow and lymph nodes, where T-cell and B-cell development takes place, suggests 

IJJ that hMSH5 has a role in development of B-cells, T-cells, or both, and that defects in hMSH5 

might result in hematological defects. 
20 [0330] hMSH5 appears to specifically interact with hMSH4. No interaction with hMSH5 
above background was observed for hMSH2, hMSH3 or hMSH6. Thus, it is likely that the 
hMSH4-hMSH5 heterodimer is specific and constitutes a functional interaction that is separate 
from hMSH2-hMSH3 and hMSH2-hMSH6 heterodimers. Based on the conservation of the 
adenine nucleotide binding and hydrolysis domain, it is likely that the hMSH4-hMSH5 
25 heterodimer also functions as a molecular switch (Gradia et al., 1997, Cell 91:995-1005). 

[0331] The disclosures of each and every patent, patent application, and publication cited 
herein are hereby incorporated herein by reference in their entirety. 

[0332] While this invention has been disclosed with reference to specific embodiments, it is 
apparent that other embodiments and variations of this invention may be devised by others 
30 skilled in the art without departing from the true spirit and scope of the invention. The 
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appended claims are intended to be construed to include all such embodiments and equivalent 
variations. 
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